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visit Xsystems.com/mini today. 
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EDITORS’ WORD 





Dear Readers, 


We hope you have been doing well. Holiday season 
has started, both public holidays and vacation. How 
was the 4th of July in the USA and Canada’s day? 
1st of July? Did you have a great time with your 
friends and families? If you are not based in North 
America, let us Know what and when is the most im- 
portant holiday in your country! 


We all love holidays, but let’s dive into this month’s 
issue. As always, we will start with news from the in- 
dustry. There has been a lot going on in the open 
source world. 


Mikhail Zakharow shared the third article from his 
storage series. This time you will read “Implementing 
In-Memory Cache in the BeaST Architecture”. It’s the 
next step in building a perfect, dual-headed FreeBSD 
storage concept, but no worries, there will be more in 
an upcoming issue. 


If you have ever had a problem with MySQL Server 
Startup on FreeBSD, Andrey Ferriyan can help you 
out with his article. 


Our expert, E.G. Nadhan from Red Hat wrote a great 
article about the open source community. Let us 
know your thoughts about your favorite benefit from 
using open source. Nicolas de Loof shared how to do 
a “Docker Cleanup” and Adrian Rosebrock his “Top 
(not) 9 Favorite Python Deep Learning Libraries” with 
us. 


If you have always wanted to start your journey with 
FreeNAS, take a look at Mark Von Fange’s second 
article about “FreeNAS Getting Started Guide: Part 2, 
The Initial Configuration Wizard”. Pure knowledge 
from the FreeNAS source. 


To wrap this issue up, we have a great interview with 
Aleksandar S. Sokolovski, Founder of SciBi; and, as 
always, Rob Somerville gets right to the point in his 
column. 


Enjoy your holidays and let us know what topics you 
would like us to cover in upcoming issues. 


Marta & BSD Team 
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News 


BSD World Monthly News 5 





by Marta Ziemianowicz 


This column presents the latest news coverage of 
events, product releases and trending topics. 


FreeBSD Corner 


Implementing In-Memory Cache in the BeaST Ar- 
chitecture 18 


by Mikhail E. Zakharov 


Recently we have created the reliable, dual-headed 
FreeBSD based storage concept. Then we improved 
it with the famous ZFS feature. Now we will try to im- 
plement a shared in-memory cache to make our 
model even more serious. 


MySQL Server 5.7 Startup Problem on FreeBSD 
10.2 35 


by Andrey Ferriyan 


I’ve just installed MySQL server 5.7 for FreeBSD 10.2 
using pkg package manager. It seems the problem 
occurred when | went to start MySQL for the first 
time. This time I’m using the real VPS server and not 
VirtualBox. 


Expert Says 


Let Us Start at the Very Beginning with Open 
Source 40 


by E.G. Nadhan 


The Red Hat coloring booth at SX Create during the 
South by Southwest (SXSW) conference and festival 
helped to tell the open source story of collaboration in 
a way that was simple enough for children to under- 
stand. It was a booth where new ideas manifested 
themselves in the form of colorful shapes and forms 
on the walls. Thousands of attendees contributed, 
bringing the 12 foot tall space to life. Ideas were ex- 
panded on by other children, modified or cloned by 
some to give birth to other related ideas. Continuous, 
enthusiastic collaboration at its peak in an environ- 


ment ripe for open communication -- and thus, innova- 
tion. Sound familiar? 


Docker 
Docker Cleanup 43 
by Nicolas De Loof 


Most Docker newcomers are disappointed when, at 
some time, after various experiments with Docker, 
they hit a no space left on device issue. Docker does 
indeed store all containers, images and volumes in 
/var/1lib/docker, which can quickly grow to giga- 
bytes. 


Python 


My Top 9 Favorite Python Deep Learning Librar- 
ies 46 


by Adrian Rosebrock 

So you're interested in deep learning and Convolu- 
tional Neural Networks. But where do you start? 
Which library do you use? There are just so many! 
This list is by no means exhaustive, it’s simply a list of 


libraries that I've used in my computer vision career 
and found particularly useful at one time or another. 


FreeNAS 


FreeNAS Getting Started Guide: Part 2, The Initial 


Configuration Wizard 57 
by Mark VonFange 


This article series is intended to serve as an introduc- 
tory guide to assist FreeNAS users in planning, instal- 
lation, configuration and administration for their Fre- 
eNAS storage systems. This month’s article will cover 
getting your FreeNAS set up using the Initial Configu- 
ration Wizard. 


Interview 


Interview with Aleksandar S. Sokolovski, Co- 
Founder of Sci Bi 69 





by Marta Ziemianowicz, Marta Strzelec & Marta Si- 
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Rob’s Column 78 





by Rob Somerville 


With the recent British referendum decision to exit 
from the Europe Union, one of the few opinion polls 
to correctly predict the outcome was online. What im- 
pact is the Internet having on the established politi- 
cal order and is this a force for good? 
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PC-BSD's Lumina Desktop 
BSD Certification Now In Beta For v1.0 





The Lumina Desktop Environment has made 


The BSD Certification Group Inc available their v1.0 beta release of the Qt- 
(BSDCG) is a non-profit organization written desktop. 

committed to creates and : PC-BSD developers and others continue 
main tauiing 2 gen certification working on Lumina as an alternative, light- 
standard for systam administration weight desktop environment. While originat- 
on BSD based operating systems. ing in the BSD world, Lumina continues to be 


designed to work on any Unix-like OS and is 
licensed under a 3-clause BSD license. 
@ WHAT CERTIFICATIONS ARE AVAILABLE? Should you not be familiar with Lumina from 


our past articles, visit Lumina-Desktop.org to 
BSDA: Entry-level certification suited for candidates P ped 


with a general Unix background and at least six months of learn more about the project. 
experience with BSD systems. 
Lumina 1.0-Beta 1 was released to start off 


BSDP: Advanced certification for senior system administrators July. The release announcement for this ver- 


with at least three years of experience on BSD systems. 


Successful BSDP candidates are able to demonstrate sion simply reads, "This is a beta-quality re- 
strong to expert skills in BSD Unix system administration. lease of the Lumina Desktop Environment 

(1.0.0-Devel, snapshot on July 1 2016). Al- 
g WHERE CAN IGET CERTIFIED? though highly stable and ready for use on 

most desktop systems, many backend sys- 
We're pleased to announce that after 7 months of tems are still changing which may cause the 
negotiations and the work required to make the exam ' ‘ ‘ 
available in a computer based format, that the BSDA Users Seung= to be ieee between vere 
exam is now available at several hundred testing centers (although this is minimized as much as possi- 
around the world. Paper based BSDA exams cost $75 USD. ble)." 


Computer based BSDA exams cost $150 USD. The price of 
EO ee ee http:/www.phoronix.com/scan.php ?page=ne 
Payments are made through our registration website: ws_item&px=Lumina-1.0-Beta 


https://register.bsdcertification.org//register/payment 


@& WHERE CAN | GET MORE INFORMATION? 


More information and links to our mailing lists, LinkedIn 
groups, and Facebook group are available at our website: 
http://www.bsdcertification.org 


Registration for upcoming exam events is available at our 
registration website: 
https://register.bsdcertification.org//register/get-a-bsdcg-id 
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Based on the principles of tangible programming, 
Google has devised a new learning tool called ‘Pro- 
ject Bloks’. With the help of three components, Bloks 
creates a ‘physical’ program that teaches the coding 
basics to young learners. 


Google Project Bloks 


Fe SES 


Google calls programming a way to develop creativ- 
ity and computational thinking. This view is sup- 
ported by different governments from all around the 
world — some of them have even included coding as 
a compulsory subject in school curriculums. 





Learn programing basics with fun 


In its latest efforts to make coding more accessible and easier, Google has partnered with the 
Stanford University and design firm IDEO to start a new initiative called ‘Project Bloks‘. 


It’s an open platform created to help the kids learn programming. With the help of physical compo- 
nents, kids use ‘Bloks’ as a reference design to create a physical ‘program’ that controls real 
world objects. 


Specifically targeted at kids, this system consists of 3 core blocks — Brain Board, Base Board, 
and Pucks. 


The Raspberry Pi-based Brain Board acts as the main controller that also houses power and com- 
munication parts. 


The Base Boards is a small block that plays the role of a home that provides LED, haptic, and 
audio feedback. 


The little Pucks are control modules that are made to perform a single function, like on and off, 
increasing/decreasing volume, stop, go, etc. When connected to a base board or another case 
board, a puck sends the specific commands to the brain board. 


With Project Bloks, Google intends to teach kids the logic behind coding and simple fundamen- 
tals. With this physical coding initiative, kids can grasp the basic skills and transfer them to the 
real-world applications. 


Let’s hope that Google makes coding toys cheaper and develops Bloks into a powerful toy that 
developers can use to create customizable learning tools. 


http://fossbytes.com/google-project-bloks-coding/ 
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Mozilla Pushes Online Privacy with New O 





Mozilla is funneling yet more money into the open source ecosystem. This week, the organiza- 
tion, best known for the Firefox Web browser, announced an award of $385,000 to fund eight 
open source projects, including several important online privacy platforms. 


The awards are part of the Mozilla Open Source Support (MOSS) program. Alongside initiatives 
like the funding of gigabit Internet, MOSS is an example of how big-name open source founda- 
tions are pursuing new strategies for shaping the open source ecosystem with their cash re- 
serves. 


The awards announced this week will support the following projects: 

¢ Tor, which develops software for hiding your identity and location online. 

¢ Tails, a Linux-based operating system designed to provide out-of-the-box privacy and security. 
¢ Caddy, a Web server that delivers HTTPS encryption by default. 

¢ Mio, a software programming library. 


¢ DNSSEC/DANE Chain Stapling, which is working to improve the process of secure DNS re- 
quests. 


¢ Godot Engine, a game engine. 
¢ PeARS, a private Internet search engine. 
¢« NVDA, a screen reader for Windows. 


A number of these projects focus on improving online privacy and security. Mozilla is making 
pretty clear its commitment to helping to create a more private, decentralized Internet free of cen- 
sorship and tracking -- or at least one in which censorship and tracking are more difficult than 
they have become in most online contexts today. 


That should send a signal to other open source organizations and developers about the direction 
in which the money is flowing inside the open source ecosystem. Demand for better privacy and 
security delivered through open source code is high, and prestigious foundations like Mozilla are 
putting up the cash to reward developers who deliver those features. 


http://thevarguy.com/open-source-application-software-companies/mozilla-pushes-online-privacy- 


new-open-source-funding-awa 
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For those interested in the latest developments of ubun- 
tuBSD, the developer mentions that the official release 
will come bundled with BusyBox and OpenRC, sans 
systemd. 


In a series of tweets, ubuntuBSD project leader Jon 
Boden has announced a few of the technical features 
coming to the soon-to-be-released ubuntuBSD 16.04 
operating system. 





A week ago, we wrote an exclusive story to tell you that the first and major release of the ubun- 
tuBSD OS is coming soon, based on the Ubuntu 16.04 LTS (Xenial Xerus) distribution, but using 
the powerful kernel from the BSD-based FreeBSD 10.3 operating system. 


https://www.freebsdnews.com/2016/06/24/ubuntubsa-1 6-04-will-feature-a-combination-of-busybo 
x-and-openrc-but-no-systemd/Upstream 


Microsoft has announced that FreeBSD 10.3 is now 
available as a virtual machine image in the Azure Mar- 
ketplace. Azure is Microsoft’s cloud computing plat- 
form and infrastructure for deploying various applica- 
tions. 


Today, I’m excited to announce the availability of 
FreeBSD 10.3 as a ready-made VM image available 
directly from the Azure Marketplace. This means that 
not only can you quickly bring-up a FreeBSD VM in Azure, but also that in the event you need 
technical support, Microsoft support engineers can assist. 





Here’s how easy it is to get up and going through the Azure portal. Simply click on the +New on 
the left pane (or the marketplace tile on your dashboard), type “FreeBSD 10.3” in the search text 
box, and you're there. 


https://www.freebsdnews.com/2016/06/1 7/microsoft-unveils-freebsd-build-for-virtual-appliances-a 


mong-azure-updates/ 
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The fourth alpha release of the upcoming FreeBSD 11.0 is now available for testing. 


FreeBSD 11.0 Alpha 4 ships the very latest fixes for this major BSD update. FreeBSD 11.0 is 
scheduled to be officially released in early September with the code freeze happening last week, 
the beta builds beginning in July, and release candidates in August. The FreeBSD 11.0 schedule 
can be found via FreeBSD.org. 


FreeBSD 11.0 is bringing updated KMS drivers, Linux binary compatibility layer improvements, 
UEFI improvements, Bhyve virtualization improvements, and a wide range of other enhance- 
ments. FreeBSD 11.0 tentative release notes can be seen here for those wanting to learn more 
about this forthcoming OS update. 


The latest FreeBSD development builds can be found at FreeBSD.org FTP. 


http://www.phoronix.com/scan.php ?page=news_item&px=FreeBSD-11.0-Alpha-4 


Thanks to open source software, Powering Po- 
tential and the Raspberry Pi Foundation are 
able to bring computers and a library of digital 
education content to rural schools in the East Af- 
rican nation of Tanzania. Recently, the Founda- 
tion funded a project that is now distributing 
Raspberry Pi computers with uploaded educa- 
tional content alongside portable projectors and 
OO FF screens to 56 schools across the Zanzibar archi- 
pelago and two mainland regions of Tanzania. 
The Segal Family Foundation also provided matching funds, which enables the project to reach 
twice as many schools. 





With a five-fold increase in the number of students in the decade following 2003, Tanzania is 
struggling to provide more schools, classrooms, teachers, desks, and textbooks. Yet whenever 
you visit rural secondary schools in Tanzania, you will find eager girls and boys in roughly equal 
numbers outfitted in uniforms with ready smiles. 


https:/opensource.com/education/1 6/6/interview-janice-lathen-powering-potential 
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FreeBSD 11 Alpha 1 — New Features Coming To This Open 
Source OS 





FreeBSD foundation has announced the code freeze for the FreeBSD 11 release, marking the 
start of the Alpha 1 release. This release brings many features, especially for the system adminis- 
trators and super users. 


For those unfamiliar with FreeBSD, it is considered one of the few operating systems left to be 
true UNIX. It is a direct descendant of the BELL/AT&T labs UNIX. Much of the software available 
for Linux is also available for FreeBSD, including Gnome and KDE desktop environments and 
much more user and server software. Despite the amount of software available, it is often thought 
of as an obscure system with a rather small software library. 


This belief is simply untrue, and is less and less true every day due to FreeBSD being able to 
emulate many Linux system calls. This means much software developed specifically for Linux 
can be run on FreeBSD as well. 


Other operating systems in the same family are Mac OS X, OpenBSD, NetBSD, and Orbis OS, 
which is the operating system that runs on PlayStation 4 Gaming consoles. 


FreeBSD is a longstanding landmark in the open source community, and while it doesn’t have as 
large a user-base as Linux, what it lacks in popularity it surely makes up for in stability, security, 
and features. 


FreeBSD is used in cutting-edge enterprises and is used for the underlying infrastructure of many 
well-known products like Netflix, WhatsApp, and even as the base for network appliances made 
by Juniper as well as other companies. 


Among the biggest changes to come when FreeBSD 11 arrives in September are updates to: 
¢ The network stac 

¢ Improvements for ARM support 

¢ Several updates to the bhyve hypervisor 

¢ Many security updates 

¢ Storage subsystem updates and an upgraded version of OpenZFS 


Are any fossBytes readers FreeBSD users? If so, let us know below in the comments what you 
think. If not, let us Know why not or if you plan on trying it out sometime soon. 


http:/fossbytes.com/freebsd-11-release-notes-features/ 
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NodeJS, Ruby, PHP, Java 





This disclosure of an unpatched Remote Code Exec flaw in the Swagger API framework compro- 
mises NodeJS, Ruby, PHP, and Java. 


Swagger is a representation of RESTful API that allows developers to get interactive documenta- 
tion, client SDK generation and discoverability. 


The Swagger generators are privileged tools for organisations to offer developers easy access to 
their APIs. 


Currently, the Swagger APIs help companies like Apigee, Getty Images, Intuit, LivingSocial, 
McKesson, Microsoft, Morningstar, and PayPal in building services with RESTful APIs. 


Now an unpatched remote code execution vulnerability (CVE-2016-5641) in the Swagger API 
framework, affecting both client and server components, has been publicly disclosed. 


The security vulnerability exists in code generators within the OpenAPI Specification, the REST 
programming tool. 


“The Open API Initiative (OAI) was created by a consortium of forward-looking industry experts 
who recognize the immense value of standardizing on how REST APIs are described.” states the 
official description. 


The remote code execution vulnerability is easy to exploit due to the availability of a Metasploit 
module released by the security researcher Scott Davis. Davis explained that injectable parame- 
ters in Swagger JSON or YAML files allow attackers to remotely execute code across NodeJS, 
PHP, Ruby, and Java. Davis highlighted that other code generation tools may be vulnerable to pa- 
rameter injection attacks. 


“This disclosure will address a class of vulnerabilities in a Swagger Code Generator in which in- 
jectable parameters in a Swagger JSON or YAML file facilitate remote code execution. This vul- 
nerability applies to NodeJS, PHP, Ruby, and Java and probably other languages as well.” Davis 
wrote in a blog post published on the Rapid7 community. “Other code generation tools may also 
be vulnerable to parameter injection and could be affected by this approach. By leveraging this 
vulnerability, an attacker can inject arbitrary execution code embedded with a client or server gen- 
erated automatically to interact with the definition of service. This is considered an abuse of trust 
in definition of service, and could be an interesting space for further research.” 


Davis explained that attackers can exploit specially crafted Swagger documents to dynamically 
create HTTP API clients and servers with embedded arbitrary code execution in the underlying 
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GOING SERVERLESS WITH AWS + SWAGGER 


operating system. The attack relies on the lack of proper sanitization of the parameters within a 
Swagger document. 


“This is achieved by the fact that some parsers/generators trust insufficiently sanitized parame- 
ters within a Swagger document to generate a client code base. 


¢ On the client side, a vulnerability exists in trusting a malicious Swagger document to create any 
generated code base locally, most often in the form of a dynamically generated API client. 


¢ On the server side, a vulnerability exists in a service that consumes Swagger to dynamically 
generate and serve API clients, server mocks and testing specs.” 


The bad news is that the flaw is still unpatched despite it being publicly disclosed; last month the 
US-CERT issued a specific alert and experts from Rapid 7 already devised a fix. 


Rapid7 first attempted to contact the maintainers of the Swagger project in April, exactly one 
week ago, on June 16, it provided to the US-CERT a patch. The Metasploit module was released 
on the date of public disclosure, June 23. 


Waiting for the release of the patch by the maintainers, users need to carefully inspect Swagger 
documents for language-specific escape sequences. 


http://securityaffairs. co/wordpress/48679/hacking/swagger-rce-flaw.html 
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The S.2852 OPEN Government Data Act aims to require true open data access at the federal 
level. In this article, | will discuss the importance of open data in government, the current state of 
open data in government, and what we need to do to implement true open data. 


When | read an article on the Center for Data Innovation site, Congress Is Stepping Up to Protect 
Open Data, | was struck by two feelings: elation and surprise. The article explains: 


Sponsored by Representatives Derek Kilmer (D-WA) and Blake Farenthold (R-TX) in the House, 
and Senators Brian Schatz (D-Hl) and Ben Sasse (R-NE) in the Senate, the bill would make 
changes to the U.S. Code to institutionalize open data best practices, such as publishing govern- 
ment data, by default using open and machine readable formats, and with an open license that 
imposes no restrictions on reuse. "Open by default" has been a mainstay of the open data move- 
ment for years, and for good reason: there is simply no way to reliably foresee the potential value 
of every government data set until the public has access to it. 


The article links to a 2013 McKinsey report, Open data: Unlocking innovation and performance 
with liquid information, that says the national open data initiative of "open by default" represents 
real economic and transparency value for the public. The report points out that there is no assur- 
ance that open data will survive the end of the current administration. 


What are the opportunities in legislation for open data within open government? 


The how and why of open data was first discussed in the 8 Principles of Open Government Data, 
published in 2007, and it has increased its importance in government, civil society, and develop- 
ers' communities. In 2009, President Obama, on his first day in office, issued his first executive 
order requiring agencies to identify and release "high value" data sets. This had an unintended 
consequence of some agencies participating and flooding data.gov with what were arguably "high 
value" data sets and some not participating. In 2013, M1313 Executive Order mandated that all 
data be open "by default." But did that really happen? 


President Obama said of his Executive Order: Making Open and Machine Readable the New De- 
fault for Government Information: 


"My Administration is committed to creating an unprecedented level of openness in Government. 
We will work together to ensure the public trust and establish a system of transparency, public 
participation, and collaboration. Openness will strengthen our democracy and promote efficiency 
and effectiveness in Government.” 


One of my open data colleagues, Dr. Dennis D. McDonald, has a concise description of the prob- 
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lem. He says that unless we address a few questions about open data, the effort will fail: 


¢ How does making open data relate to or support the goals and objectives of the government 
programs that generate the data? 


¢ How much will open data cost? 
¢ Who will pay? 


The executive orders, albeit well intentioned and ground-breaking, did not deliver on the prom- 
ises made, and executive orders are easy to reverse. Legislation is a stronger approach. 


https://opensource.com/government/16/6/us-open-data-legislation 


Microsoft Releases Open Source .NET Core 1.0 For Linux, Win- 


dows, And macOS 





Microsoft has released the .NET Core 1.0 for Linux, Windows, and macOS. This announcement 
adds a new chapter in Microsoft’s love affair with Linux and open source. While Red Hat’s RHEL 
Linux distribution currently supports .NET Core 1.0, Microsoft is working to bring the same ability 
to Debian, Ubuntu, and CentOS. 


Back in November 2014, Microsoft promised to make more Core products open source. Now, 
making good on its promise, Microsoft has finally announced the Windows, Linux, and macOS 
versions of .NET Core 1.0 and ASP .NET Core 1.0. 


The latest open release also includes the release of the .NET Standard Library, making the proc- 
ess of reusing the code across different types of devices like PCs, iOS, and Android much easier. 


These announcements were made at Red Hat’s tech conference in San Francsisco. Notably, Red 
Hat and Microsoft are often considered fierce rivals in the cloud and enterprise world. 


The .NET Core 1.0 is a result of the contribution of more than 18,000 developers and 1,300 com- 
panies. Out of these companies, Microsoft's .NET Core 1.0 is officially supported on Red Hat’s 
RHEL Linux distribution and OpenShift. 


This partnership with the enterprise-friendly Linux distro will allow enterprises to run different 
microservices-based apps that are made up of both .NET and Java components. 
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Microsoft is also working to make .NET Core available for Debian, Ubuntu, and CentOS distribu- 
tion. The announcement also includes the news that Samsung is joining the .NET Foundation 
Steering Committee. 


“This is the biggest transformation of .NET since its inception and will define .NET for the next 
decade,” Microsoft said. 


The same summit also witnessed the release of a new Azure Resource Manager template to 
make the process of deploying OpenShift on RHEL a simple task. 


For more information, you can visit Microsoft's .NET Blog 


http:/fossbytes.com/microsoft-releases-open-source-net-core-1-linux-windows-macos/ 


iXsystems' TrueNAS Firmware Update Delivers Com 


formance, Re i Improvements 





TrueNAS 9.10 Release Provides up to 40% Improved Performance and More Options to Repli- 
cate Data and Graph System Telemetry 


SAN JOSE, CA--(Marketwired - July 05, 2016) - Enterprise storage vendor iXsystems today an- 
nounced the release of Version 9.10 of its TrueNASenterprise storage array firmware. TrueNAS 
9.10 represents the third generation of core software for the award-winning line of enterprise stor- 
age arrays and is available to all new and existing TrueNAS users. At the center of TrueNAS 9.10 
is an update to the FreeBSD 10.3 operating system which brings significant performance improve- 
ments and provides the foundation for the next generation of network fabrics including 100GbE 
Ethernet. 


"My lab benchmarks show TrueNAS 9.10 serves file and block data up to 40% faster than True- 
NAS 9.3, especially with higher-end fabrics like 10 and 40GbE. These results should translate to 
noticeable real-world gains and are just the beginning of what we can do with this solid new plat- 
form," said Josh Paetzel, iXsystems Senior Storage Architect. 


In addition to performance gains for the SMB, NFS, AFP, iSCSI and Fibre Channel protocols, Tru- 
eNAS 9.10 also introduces bidirectional TrueSync replication which allows primary and backup 
storage hosts to efficiently reverse roles, bringing the other host up to date as needed. This new 
feature is just one of the many improvements to the TrueNAS High Availability and replication ar- 
chitecture to deliver faster, more resilient live failovers in service of upgrades or outages. True- 
Sync replication is available between TrueNAS systems on the same LAN or across the globe. 
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For users who actively monitor their storage infrastructure, the TrueNAS 9.10 update expands 
TrueNAS performance telemetry graphing to include iSCSI and Fibre Channel interfaces, plus the 
ability to visualize telemetry data on a local Grafana or Graphite server. TrueNAS provides the 
user with 50 built-in metrics through its web interface and now exposes over 100 metrics to an ex- 
ternal server for best-in-class storage system performance analytics. 


"TrueNAS 9.10 brings with it a new operating system base that incorporates many new features 
and performance improvements, all while maintaining its hallmark of stability," says Kris Moore, 
Director of FreeNAS and TrueNAS Development for iXsystems. "The new FreeBSD 10.3 base 
will provide the platform for TrueNAS to continue to grow its feature-set in 2016 and beyond." 


TrueNAS updates are available through the TrueNAS software updater, a component of the True- 
NAS administration interface. TrueNAS users will be notified of the availability of the TrueNAS 
9.10 update and should contact iXsystems Technical Support if they have any questions. 


To learn more about TrueNAS send an email to sales@iXsystems.com, call 1-855-GREP-4IX, or 
visit www.iXsystems.com/TrueNAS. 


Source: iXsystems Newsletter 
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Implementing In-Memory Cache in 
the BeaST Architecture 


by Mikhail E. Zakharov 


Recently we have created the reliable, dual-headed FreeBSD 

based storage concept. Then we improved it with the fa- 
mous ZFS feature. Now we will try to implement a shared in- 

memory cache to make our model even more serious. 


In the enterprise systems with the real hardware controllers, we should think of connecting the 
storage cache directly with the high-speed buses that are shared between the controllers. But in 
our virtual environment, actually, this is still an Oracle VM VirtualBox on my laptop, we have to 
use LAN for every interconnection. Earlier we decided to stop on the iSCSI protocol for our sim- 


ple SAN solution. 


In general, the virtual environment is not changed much: two machines (ctrl-a and ctrl-b) are the 
storage controllers and one is the client (clnt-1). But in the previous case with ZFS tests, we had 
to add four additional virtual drives to emulate SSDs as the storage space for the cache. We do 
not need them any more because now we intend to use RAM for this purpose. Therefore, we can 
easily copy our previous virtual machine configurations with the exception of the fxx drives. 


The configuration summary is shown in the table below: 






Inter-controller 
(private) network. 
Host-only adapter 
(vboxnet0) 

Public network. Host- 
only adapter 
(vboxnett1) 








IP: 192.168.56.10 


Mask: 255.255.255.0 


IP: 192.168.55.10 


Mask: 255.255.255.0 






IP: 192.168.56.11 






Mask: 255.255.255.0 






IP: 192.168.55.11 






Mask: 255.255.255.0 













IP: 192.168.55.20 


Mask: 255.255.255.0 
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Base memory 2048 MB or more 


2048 MB or more 


Any appropriate value 
starting with 512 MB 
will do 





Shareable, fixed-sized 
virtual drives for ZFS 
data volumes on the 


d0o, d01, d10, d11 — 
each drive is 100 MB 
or more 


doo, d01, d10, d11 — 
each drive is 100 MB 
or more 


SATA controller. 





At least 5 GB to store 
FreeBSD 10.3- 
Release default 
installation 


System virtual drives 
(Dynamic-sized) on 
the IDE controller 


At least 5 GB to store 
FreeBSD 10.3- 
Release default 
installation 


At least 5 GB to store 
FreeBSD 10.3- 
Release default 
installation 

















Note that it's still important to create data-drives (d00, d01, d10, d11) with the fixed-sized and 
shareable options enabled (see Oracle VM VirtualBox Virtual Media Manager for the details). Oth- 
erwise, we will be unable to connect these drives simultaneously to both machines (ctrl-a and 
ctrl-b) and emulate the drive enclosure that is shared between the storage controllers. 


Also, | will continue to use a USB memory-stick to store fixed-sized shareable drives for our test- 
ing. But this is being done only in order to slowdown I/O operations and see what is going on in 
the model. 


The latest FreeBSD 10.3 Release can be installed on the non-shareable ada0O drives of the virtual 
storage machines with the configuration changes in /etc/rc.conf, as usual for our project:. 






hostname="ctrl-a" hostname="ctrl-b" 







ifconfig_em0="inet 192.168.56.11 netmask 
255.255.255.0" # Inter-controller LAN 


ifconfig_em0="inet 192.168.56.10 netmask 
259.259.255.0" # Inter-controller LAN 


ifconfig_em1="inet 192.168.55.11 netmask 
255.255.255.0" # Public network 


ifconfig_em1="inet 192.168.55.10 netmask 
200.200.200.0 # Public network 
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sshd_enable="YES" sshd_enable="YES" 


# Set dumpdev to "AUTO" to enable crash # Set dumpdev to "AUTO" to enable crash 
dumps, "NO" to disable dumps, "NO" to disable 


dumpdev="AUTO" dumpdev="AUTO" 


# VirtualBox guest additions # VirtualBox guest additions 
vboxguest_enable="YES" vboxguest_enable="YES 


vboxservice_enable="YES" vboxservice_enable="YES" 


# iSCSI # iSCSI 
ctld_enable="YES" # Targets ctld_enable="YES" # target 


iscsid_enable="YES" # Initiators iscsid_enable="YES" # initiator 











Set iSCSI “disconnection on fail” kernel variable in /etc/sysctl.conf on both systems to be able to 
failover to the alive controller in case of disaster: 


KetiatoeGs eat POU econmecre ton | 





After finishing all these basic FreeBSD installations and preparations, we can start in-memory 
cache configuration. 


The in-memory cache 


As we did not write a kernel module with a device driver for our own cache algorithm, we will use 
ZFS caching capabilities to reach the goal. In the previous article, we have already learned how 
to configure ZIL and ARC/L2ZARC on the shared drives to suit our needs as write- and read- 
caches. In this part, we will try to allocate these caches in the RAM-drive spaces. 


And at this point, we have at least two major problems to solve. 


First is that the storage system with its in-memory cache is very sensitive to power loss. 
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Therefore, in real life, we must protect both of the controllers with independent electrical power 
sources and UPSs. The same requirement is applied to the drive enclosures, as well. Hopefully, 
now we are playing in the sandbox of the virtual lab without any useful data, so we should not 
worry about losing it. 


Second, we need to invent a synchronization mechanism to enable cache-mirroring feature be- 
tween the controllers. And in our limited environment, we have to use iSCSI protocol over the vir- 
tual LAN as the main transport for this interconnection. 


To accomplish this task, we will need to create four memory drives (md) on both controllers to 
maintain primary and secondary copies of the read- and write- caches and appropriate inter-layer 
devices — ggate (actually this is ggatel — a “local” version) to simplify ZFS management by using 
the unified device names. See the table below for the cache structure details: 




















md0O ggate0 ctrl-a ctrl-a write-cache ZFS ZIL primary copy 

md‘ ggate1 ctrl-a ctrl-a read-cache ZFS L2ARC primary copy 

md2 ggate2 (on ctrl-a ctrl-b write-cache ZFS ZIL secondary copy 
gmirror)* 

md3 ggate3 (on ctrl-a ctrl-b read-cache ZFS L2ARC secondary 
gmirror)* copy 

md0O ggateO (on ctrl-b ctrl-a write-cache ZFS ZIL secondary copy 
gmirror)* 

md1 ggate1 (on ctrl-b ctrl-a read-cache ZFS L2ARC secondary 
gmirror)* copy 

md2 ggate2 ctrl-b ctrl-b write-cache ZFS ZIL primary copy 

md3 ggate3 ctrl-b ctrl-b read-cache ZFS L2ARC primary copy 














*We cant do it right on this step, therefore, see the gmirror configuration on the next page. 
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Now we can implement this structure by the commands: 










mdcont 1.¢ mdcontirg =—a 





MACONTLG: =e =F ewap ‘=e LZem =a 1 MaConliG- -a -—— Swap. -s JZzem —u 2 
maconrig —a -t swap —s 120m —u 2 mdconfig -a -t swap -s 128m -u 2 


mdconfig -a -t swap -s 128m -u 3 mdconfig -a =-t swap -s 176m =u 3 


ggatel create -t 1 -u 0 /dev/md0 ggatel create -t 1 -u 2 /dev/md2 


ggatel create -t 1 -u 1 /dev/mdl ggatel create -t 1 -u 3 /dev/md3 








We use eight 128 MB memory areas, which gives us 1 GB of cache divided between two control- 
lers. In other words, we have 256 MB of read-cache and 256 MB of write-cache. Actually, these 
sizes depend on the real memory of the controllers but the basic rule is simple: install more mem- 
ory and use as much cache as possible. 


Now we can establish iSCSI interconnections between cache areas of the opposite controllers. 
So edit /etc/ctld.conf on both controllers to enable iSCSl-targets. Let's use even LUNs (0 and 2) 
to access the write-cache and odd LUNs (1 and 3) for the read-cache: 










portal-group pgO { 





portal-croup: pdU: 4 


discovery-auth-group no- discovery-auth-group no- 
authentication authentication 


listen 192.168.56.10 listen 192.168.56.11 
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target ign. 
2016-01.local.sss.private:target0O 
{ 


auth-group no-authentication 


portal-gqroup pgd 


lun O { 


path /dev/md0 


Lum 1. 7 


path /dev/mdl 








target ign. 


2016-01.local.sss.private:target0O 


{ 


auth-group no-authentication 


portal-group pg0 


lun 0 4 


path /dev/md2 


lun 1 { 


path /dev/md3 


Tell ct1d to re-read its target configurations and then initiate iSCSI interconnections: 










killall -HUP ctld 





iscsictl -A -p 192.168.56.11 -t iqn. 
2016-01 .local.sss.private:targetO 
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killall -HUP ctld 


iscsictl -A -p 192.168.56.10 -t iqn. 


2016-01 .local.sss.private:targetO 
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But this was only a half of the job. The other part is to create cache synchronization mechanisms 
to maintain cache mirroring between the controllers. We also use gmirror for this task and 


ggate (ggatel) to hide these mirrors under the inter-layer devices (see the remark on the page 
above): 


gmirror load 
omirror load 


gmirror label ctrl _b zil /dev/ 


daQ /dev/md2 gmirror label ctrl _a_zil /dev/ 


da0O /dev/md0 
gmirror label ctrl_b arc /dev/ 


dal /dev/md3 gmirror label ctrl _a_arc /dev/ 


dal /dev/mdl 


ggatel create -t 1 -u 2 /dev/ 


ee eee ggatel create -t 1 -u 0 /dev/ 


Marror/ccrl, 2. Zaid. 


ggatel create -t 1 -u 3 /dev/ 


mirror/ctrl b arc ggatel create -t 1 -u 1 /dev/ 


mirror/ctrl a arc 











We have just invented the structure of the mirrored devices and their unified inter-layer interface. 
Now we can try to populate it with the ZFS cache. 


ZFS 


The Pool configuration is quite simple and well known from the previous article. 


We create the ctril-a_m0O and the ctrl-b_m0O pools on the shareable drives (ada1/ada2 and ada3/ 
ada4). Then we allocate volumes on these pools: 





zpool create -m none ctrl-a_m0 / 


zpool create -m none ctrl-b m0 / 
dev/adal /dev/ada2 — 


dev/ada3 /dev/ada4 










Zits create =v 120M ctrl-a.m0/Vv0 zfs create -V 120M ctrl-b m0/v0 
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Now we can configure ZFS cache. We will use RAM-drives (hided by the ggate pseudo-devices) 


for this purpose: 


# ZI, 


zpool add -f ctrl-a_m0 log /dev/ 
ggate0 


zis set sync=always ctrl=-a_m0 


# ARC/L2ZARC 


zpool add ctrl-a_ m0 cache /dev/ 
ggatel 


zfs set primarycache=none ctrl- 


a_m0 


zfs set secondarycache=all ctrl- 


a_m0 





+ ZI 


Zpool. add. =f ctrl-b m0 log /dev/ 
ggate2 


Zi8 set. sync=always. ctril-b m0 


# ARC/L2ARC 


zpool add ctrl-b m0 cache /dev/ 
ggate3 


zfs set primarycache=none ctrl- 
b m0 


zfs set secondarycache=all ctrl- 
b_ m0 








At the final step of ZFS configuration, we will import pools from opposite controllers and set spe- 
cial flags to ignore all errors on them. Yes, this is dangerous but we must do it to be able to fail to 


the alive controller in case of failure: 






ZD00l import. =N .cerl=b m0 









zpool set failmode=continue ctrl- 


a_m0 


zpool set failmode=continue ctrl- 
b m0 






Zpool import -N ctrl-a m0 





zpool set failmode=continue ctrl- 
a_m0 


zpool set failmode=continue ctrl- 
b m0 
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Note, we set -N option to disable auto-mounting ZFS function. 


At this point, ZFS pools of both controllers with their mirrored in-memory caches should work, 
therefore, we can setup the volume failover arbitrator mechanism. 


The Arbitrator 


The volume arbitrator, which we use to switch volumes between the controllers, is pretty much 
the same as in both previous tests. 


Therefore, just check and edit /etc/ctld to add appropriate volumes (we use LUN 10 on both con- 
trollers for them): 


Dortal=Group pou 4 POrtal=-Gqroup pg. 4{ 
di sCcovery=aulh—qroup. nmo= discovery-auth-group no- 
authentication authentication 
listen 192.1¢66.56.10 listen 192.168.56.11 
} } 
target ign. target ign. 


2016-01.local.sss.private:target0O 2016-01.local.sss.private:target0O 
{ { 








AuLh=-Group no= aAucth=Group no= 
authentication authentication 

portal-group pg0 portal-group pgO 

# cache # cache 

lun 0 { lun 2 4 

path /dev/md0 path /dev/md2 
} } 
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lun 1 { lun 3 { 
path /dev/mdl path /dev/md3 
} } 
# data volumes # data volumes 
lun 10 { lun 1G 4 
path /dev/zvol/ path /dev/zvol/ 
ctrl-a_m0/v0 ctrl-b m0/v0 
} } 
} } 


Then, force ctld to re-read its configuration file and reinitialize iSCSI connections to discover the 
data-volumes. Finally, assemble the arbitrator mechanism by creating active-passive multipath 
pseudo-device: 


killall -HUP ctld kiliall =hUP chld 


# reinitialize iSCSI backend a ees 
+ FelInitialize aSsCel backend 
session 

session 


IAca Vcr =M aa 2 =o PO2. toe. 56.11 


tseceiCEl —Meaa do-p 192.166 .56.10 
=e. Les 


=E. a ‘ 
2016-01.local.sss.private:target0O ae 
2016-01.local.sss.private:target0O 


# Create the arbitrator 
# Create the arbitrator 


gmultipath create CTRL _B BACK / 


1ti El te CTRL A BACK 
dev/da2 /dev/zvol/ctrl-b_ m0/v0 gmuttipa Creare AL 7 


dev/da2 /dev/zvol/ctrl-a_m0/v0 
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Note the -M option of the iscsict1 command that we use to modify the previously established 
session. 


Front-End 


In the final part of the storage-side preparations, we should configure the iSCSI front-end to ac- 
cept the client connections. Therefore, we add “public” related sections to the /etc/ctl.conf: 


Pportal=group pq0 { 

portal—group pg? -{ 

di SCcovery=aulh= Group: io 
discovery-auth-group no- authentication 

authentication 

listen 192.168.56.11 

lasten 192.168.56.10 


portal-croup- pal + 

portal-group pgl { 

di SCcovery=aulh=Group no= 
discovery-auth-group no- authentication 

authentication 

listen 192.168.55.11 

listen 192.1638 s-55<10 


} 


target iqn. 
2016-01.local.sss.private:target0 target ign. 
{ 2016-01.local.sss.private:target0O 
{ 

auth-group no- 
authentication auth-group no- 
authentication 
portal-group pg0 
pOrtal=group p00 
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# cache # cache 
lun O { ian: 2 4 
path /dev/md0 path /dev/md2 
} } 
Jaan 2... 4 ah. 4° 4 
path /dev/mdl path /dev/md3 
} } 
# data volumes # data volumes 
aan. 0" 4 lun 10 { 
path /dev/zvol/ path /dev/zvol/ 
ctrl-a_m0/v0 ctrl-b_ m0/v0 
} } 
} } 
target ign. target ign. 
20 LO-O1L. local.ess.public*tarcec). { |201l6-01.local.sss.publicstargerd { 
aurLhn=—orouy no= Aun =Grouy To= 
authentication authentication 
portal-group pgl portal-gqroup pal 
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lun 0 { lum 0 4 
path /dev/zvol/ path /dev/zvol/ 
ctrl-a_m0/v0 ctrl-b_m0/v0 
} } 
lun 1 4 Jam 4 
path /dev/ path /dev/ 
multipath/CTRL B BACK multipath/CTRL A BACK 
} } 
} } 





Finally, we must tell ctld to update its LUN information once more. Therefore, run on both control- 
lers: 





# killall -HUP ctld 


After fresh additions and changes to the concept of the BeaST storage, the structure of our model 
became quite complex. Therefore, see the full architecture layout below to get the whole picture 
with a bird's-eye view. 
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Figure 1. The BeaST storage architecture with ZFS and in-memory cache overview. 
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The client-side 


The client configuration has not changed since the first test. So take the previous client virtual ma- 
chine as-is and check FreeBSD basic configuration in /etc/rc.conf: 


hostname="clnt-1" 


akisoroyoGumep mer uabigionm ANCA svc Moro eA Gh ielene isle < valor on aaron Ohy. ois Jeillomeaibe: gatein = 
work 


icyou gel —iqhel oll Tg oro 


# Set dumpdev to "AUTO" to enable crash dumps, "NO" to disable 


dumpdev="AUTO" 


# VirtualBox guest additions 
idelon.deibicishe: Yoietsloll oy Gays) ie 


WASo Sela enalictey Teimelod Sy idlihisy 4 


pease sal 


asvefsal(e) Kouls leo aS. ig dligalivaleslccene,s 


The /etc/sysctl.conf file is also the same: 


I oneg ght s even mages le tejateyc elis/exejaie\(xeealieyal—a 


BSD 





Then, run all the very well known commands to bring iSCSI connections online, form multipath 
pseudo-devices, stripe the volume, create and mount the new file system: 


Now you can start the common tasks to test the BeaST solution for its functionality and reliability. 
See our previous works “FreeBSD based dual-controller storage system concept” and “Adding 
ZFS to the FreeBSD dual-controller storage concept” for the details of these processes. 


In this paper, we have offered the mechanism of in-memory cache implementation for the storage 
system based only on the standard FreeBSD and ZFS features. 


This was the most important and interesting part of this paper indeed. So the most common and 
boring tasks (like FreeBSD installations, CLI outputs along with functional tests, etc.) were cut off 
from this paper by the Occam's razor principle, as they completely repeat the previous two arti- 
cles. 


We have done much, but there are still more things to do. We must test the BeaST on the real 
hardware, write installation and clustering scripts to automate typical tasks (like volume creation, 
migration, recovery, etc) on both controllers. Also, we should check the new and interesting 
changes in CAM target layer of FreeBSD 10.3. 
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Finally, we must warn you that the BeaST concept is the study of storage systems technology. All 
the ideas, algorithms and solutions are at concept, development and testing stages. Do not imple- 
ment the BeaST in production as nobody can guarantee that you will not lose data. Be careful 


and always do backups! 











About the Author: 


My name is Mikhail E. Zakharov and | 
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’ve just installed MySQL server 5.7 for FreeBSD 10.2 using 
pkg package manager. It seems the problem occurred when 
| went to start MySQL for the first time. This time I’m using 
the real VPS server and not VirtualBox. 


The error is: 


Here is what happened to my VPS server. 
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The default installation from pkg package manager will be in /usr/local/ and the configura- 
tion file will be in 


Some people will try to fix the problem by deleting some line from /usr/local/etc/rc.d/mysql-server. 
| don’t know the result from deleting some line but | had to try using the following method. 


1. Fix ownership from my.cnf file in 


Look at the following figure from my VPS server. 





Notice that the ownership of my.cnf is user: root and group: wheel. You have to change the owner- 
ship to user: mysql and group: mysql. using the following command: 
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2. Next after changing the ownership you can initialize MySQL for the first time before we can use 
it for production use. The new version of MySQL server uses command mysqld - initialize for 
initialization instead of mysql install db. If you called only mysqld -initialize the com- 
mand cannot be recognized by the shell because we don't include the file path in the PATH vari- 
able. So you have to use an absolute path like this: 


3. The following figure will help you understand how the process works. 





You may notice that from the red square there is a temporary password for managing the MySQL 
server. But you have to understand that the MySQL server still doesn’t start. Now the next step is 
using the following command for starting the MySQL server: 
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The following figure below will be the detail information. 





Do the ps -ax | grep mysq] for searching the MySQL server daemon and you can see that MySQL 
server is up and running. 


3. Next write the temporary password from the generator before to /root/.mysql_ secret and then 
do the following command for installing the tables and fix the secure installation. 


There will be questions you have to answer. The following figure will help you answer questions 
from MySQL installation. 
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Expert Says... 


Let Us Start at the Very Beginning 
with Open Source 


by E.G. Nadhan 


The Red Hat coloring booth at SX Create during the South 

by Southwest (SXSW) conference and festival helped to tell 

the open source story of collaboration in a way that was sim- 
ple enough for children to understand. It was a booth where 

new ideas manifested themselves in the form of colorful 

shapes and forms on the walls. Thousands of attendees con- 
tributed, bringing the 12 foot tall space to life. Ideas were ex- 
panded on by other children, modified or cloned by some to 

give birth to other related ideas. Continuous, enthusiastic 

collaboration at its peak in an environment ripe for open 

communication -- and thus, innovation. Sound familiar? 


This is exactly what the Open Source community is all about. The children get it. That is what 
they do naturally in an uninhibited manner without any artificial boundaries. And thus, the princi- 
ples of Open Source can be easily injected into the minds of the school-going children. Let us join 
hands to start at the very beginning with Open Source. 


“What I value is the Open Hearts and Open minds”, said Carol Spindel, Lecturer, Dept. of Eng- 
lish at the University of Illinois at Uroana Champaign. It was the graduation reception for the Cam- 
pus Honors Program for the Class of 2016 and Spindel received the Teaching Award for this year. 
In the brief speech that she gave after receiving this award, Spindel characterized the spirit of the 
students she works with in her simple but powerful assertion. An Open mind is exactly what Intel 
VP, Imad Sousou called out in his recent keynote at the Open Stack Summit in Austin. At the 
same event, | had close encounters with two individuals with fascinating backgrounds and per- 
spectives who bring diversity -- and therefore, an open mindset -- to the world of open source. 
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Expert Says... 





Gina Likins is part of the University Outreach team at Red Hat, which exists to help universities 
incorporate open source into their curriculum. “In an ideal world, students would graduate from 
college ready to contribute to an open source community in some significant way” -- reads the ab- 
stract for her session at the Open Source Convention in Austin, TX. Likins presented this session 
along with Professor Heidi Ellis (Western New England University) and Professor Gregory Hislop 
(Drexel University). 


With the Open mindset being a defining characteristic of the student community, the principles of 
Open Source must be an integral part of the academic curriculum across educational institutions. 
Here are 6 colleges proactively turning out open source talent. Gina, Heidi, and Gregory also de- 
scribe the inventory of learning paths they are building to educate students about open source 
and define a pathway to becoming contributors. This pathway includes a sequence of activities 
that prepare students for successful humanitarian FOSS participation. The activities cover a vari- 
ety of aspects of open source participation, including technical, process, and cultural knowledge. 


In addition to champions like Gina, Heidi and Gregory, we also need advocates like Charlie Re- 
isinger embedded within the schooling system to drive the adoption of Open Source principles. At 
Penn Manor School District, Reisinger’s team implemented a unique 1:1 school laptop program 
using free and open source software exclusively. Reisinger loves to speak on open source learn- 
ing technologies, and is currently writing a book about their open schoolhouse adventures”. Side- 
bar: | really enjoyed being part of the #OpenOrg Twitter Chat with Reisinger. Open Minds: Can 
you take a hint? 


Injecting Open Source into academia has a greater likelihood for sparking innovation. Take, for 
instance, the ninth annual Imagine RIT, the Rochester Institute of Technology's annual innovation 
and creativity festival, held this year where about 30,000 people arrive on campus to view stu- 
dent, faculty, and staff demonstrations. Visitors experience everything RIT has to offer through in- 
teractive presentations, hands-on demonstrations, exhibitions, and research projects set up 
throughout campus. One of the tables at this year's festival was the FOSS@MAGIC table, where 
students demonstrated their open source software and hardware projects. 


There you have it. Forward thinking academic institutions are taking the right steps to fuel the 
spirit of open source across their student community. And why shouldn't they? There is a global 
community of students with the right mindset to embrace, adopt, and innovate the open source 
way. 


The student world is primed and ready for corporate organizations and academic institutions to 
join forces to inject a healthy dose of Open Source. 


Let us start at the very beginning by taking the baby step to introduce Open Source across the 


schooling system. 
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With over 25 years of experience in the IT industry sell- 
ing, delivering and managing enterprise solutions for 
global enterprises, E.G.Nadhan is the Chief Technol- 
ogy Strategist at Red Hat (Central Region) working 
with the executive leadership of enterprises to innova- 
tively drive Cloud Transformation with Open Source 
technologies and DevOps. Nadhan also provides 
thought leadership on various concepts including Big 
Data, Analytics and the Internet of Things (loT). Nad- 
han has published 500+ blog posts over four years in 
multiple Blogs including HP, The Open Group, Enter- 
prise ClO Forum, 1CloudRoad and Intel !TCenter 
while interacting with analysts from Forbes, Gartner 
and IDC. Prior to joining Red Hat, Nadhan was an HP 
Thought Leader and the Global Leader for the Culture 
of Innovation where he worked with the executive lead- 
ership of strategic accounts to realize in- 
novative ideas that address key business 
challenges for customers. As the Co- 
Chair for multiple projects within the 
Cloud Work Group, Nadhan led the publi- 
cation of the first Cloud Computing Techni- 
cal Standard and leads the Cloud Govern- 
ance project within The Open Group. He 
is a recognized author/speaker at industry 
conferences and has co-authored multi- 
ple books. Follow him on Twitter 

. Connect with him on LinkedIn 
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Docker Cleanup 


by Nicolas De Loof 


Most Docker newcomers are disappointed when, at some 
time, after various experiments with Docker, they hit a no 
space left on device issue. Docker does indeed store all con- 
tainers, images and volumes in /var/1ib/docker, which 


can quickly grow to gigabytes. 


One can find tons of blog posts about how to cleanup this folder. Here I'd like to give my own tips, 
and to explain them in detail, so you don't run random commands found by Googling the Internet. 


/var/1ib/docker is used to store: 

¢ Docker images 

¢ Docker container descriptors 

¢ Docker networks descriptors 

¢ Docker volumes 

¢ Containers’ layers (depends on the storage driver you used. Typically AUFS) 
Terminated containers 


| use Docker a lot to experiment with Uunix commands, so | run docker run --it ubuntu bash. 
When you run a container without the --rm option, the container still exists on disk after the com- 
mand completes. So doing this will create tons of containers, and layers for things | modified on 
the container's filesystem. For this reason, | created an alias docker-run to ensure | don't forget 
this option. But this option can't be used with -d (run in background) so | can't use it when | want 
to run some backend service for testing purpose. 
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At the end of the day, | have tons of stopped containers that | don't use anymore, and will just con- 
sume disk space in /var/lib/docker. So | use this command to run some cleanup: 


| docker rm -v $(docker ps --filter status=exited -q) fa 


This command lists all exited containers (thanks to status filter), and only dumps their ID (-q). This 
ID list is used by the remove command to cleanup containers, including volumes they were using. 


Unused volumes 


Volumes? Yes, when you run a container which has been built with a VOLUME command in it's 
Dockerfile, Docker implicitly creates a volume on the disk, and may copy data from the container 
image in this volume. This can result in significant disk consumption. Removing a container with 
-v option forces Docker to remove such volumes. This doesn't apply to bound mount volumes, 
only to volumes created by Docker daemon. 


If you already removed your containers without using -v, volumes remain orphaned in /var/lib/ 


| docker volume rm $(docker volume ls -q -f 'dangling=tri 


docker. You can remove them as well using: 
The "dangling" filter selects volumes that aren't referenced by any container. 


Note: one can find many scripts to do comparable cleanup by directly making changes to /var/lib/ 
docker. Those scripts were written before docker volume command was introduced in 1.9. Don't 
use them. Directly hacking your Docker daemon storage isn't a good idea when you have a clean 
API for this purpose. 


Unused images 


You can also use a comparable dangling filter with images. This one will detect image layers that 
| docker rmi $(sudo docker images -f "dangling=true" -q) @ 


are not referenced by a tag, which in many cases is the result of running Docker builds. 
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What about obsolete / unused images? 


You can find some scripts to detect images that aren't used by a container on your system. In a 
production environment, this can make sense, but for my workstation, this would remove mostly 
all Docker images, as most containers | run don't keep running all day long. 


Docker doesn't track image use by containers, this issue is tracking attempt to change this, so 
writing a garbage collector would be simpler. So far I'm using 
https://github.com/ndeloot/docker-gc to collect image usage based on Docker events. Not perfect, 
but does the job. 





About the Author: 


, Nicolas De Loof - Docker Captain, Hacker at CloudBees and community 


Source of the article - Nicolas’ blog: 


| http://blog.loof.fr/2016/05/docker-cleanup.htmi#links 
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My Top 9 Favorite Python Deep 
Learning Libraries 


by Adrian Rosebrock 


So you’re interested in deep learning and Convolutional Neu- 
ral Networks. But where do you start? Which library do you 
use? There are just so many! 


This list is by no means exhaustive, it’s simply a list of librar- 
ies that I’ve used in my computer vision career and found 
particularly useful at one time or another. 


Some of these libraries | use more than others — specifically, Keras, mxnet, and sklearn-theano. 


Others, | use indirectly, such as Theano and TensorFlow (which libraries like Keras, deepy, and 
Blocks build upon). 


And even others, | use only for very specific tasks (such as nolearn and their Deep Belief Net- 
work implementation). 


The goal of this blog post is to introduce you to these libraries. | encourage you to read up on 
each of them individually to determine which one will work best for you in your particular situation. 


My Top 9 Favorite Python Deep Learning Libraries 


Again, | want to reiterate that this list is by no means exhaustive. Furthermore, since | am a com- 
puter vision researcher and actively work in the field, many of these libraries have a strong focus 
on Convolutional Neural Networks (CNNs). 


I've organized this list of deep learning libraries into three parts. 


The first part details popular libraries that you may already be familiar with. For each of these li- 
braries, | provide a very general, high-level overview. | then detail some of my likes and dislikes 


about each library, along with a few appropriate use cases. 
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The second part dives into my personal favorite deep learning libraries that | use heavily on a 
regular basis (HINT: Keras, mxnet, and sklearn-theano). 


Finally, | provide a “bonus” section for libraries that | have (1) not used in a long time, but still 
think you may find useful or (2) libraries that | haven't tried yet, but look interesting. 


Let’s go ahead and dive in! 
For starters: 
1. Caffe 


It’s pretty much impossible to mention “deep learning libraries” without bringing up Caffe. In fact, 
since you're on this page right now reading up on deep learning libraries, I’m willing to bet that 
you've already heard of Caffe. 


So, what is Caffe exactly? 


Caffe is a deep learning framework developed by the Berkeley Vision and Learning Cen- 
ter(BVLC). It’s modular. Extremely fast. And it’s used by academics and industry in start-of-the-art 
applications. 


In fact, if you were to go through the most recent deep learning publications (that also provide 
source code), you'll more than likely find Caffe models on their associated GitHub repositories. 


While Caffe itself isn’t a Python library, it does provide bindings into the Python programming lan- 
guage. We typically use these bindings when actually deploying our network in the wild. 


The reason I’ve included Caffe in this list is because it’s used nearly everywhere. You define your 
model architecture and solver methods in a plaintext, JSON-like file called .prototxt configuration 
files. The Caffe binaries take these .prototxt files and train your network. After Caffe is done train- 
ing, you can take your network and classify new images via Caffe binaries, or better yet, through 
the Python or MATLAB APIs. 


While | love Caffe for its performance (it can process 60 million images per day on a K40 GPU), | 
don't like it as much as Keras or mxnet. 


The main reason is that constructing an architecture inside the .prototxt files can become quite 
tedious and tiresome. And more to the point, tuning hyperparameters with Caffe can not be (eas- 
ily) done programmatically! Because of these two reasons, | tend to lean towards libraries that al- 
low me to implement the end-to-end network (including cross-validation and hyperparameter tun- 
ing) ina Python-based API. 
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2. Theano 


Let me start by saying that Theano is beautiful. Without Theano, we wouldn't have anywhere 
near the amount of deep learning libraries (specifically in Python) that we do today. In the same 
way that without NumPy, we couldn’t have SciPy, scikit-learn, and scikit-image, the same can be 
said about Theano and higher-level abstractions of deep learning. 


At the very core, Theano is a Python library used to define, optimize, and evaluate mathematical 
expressions involving multi-dimensional arrays. Theano accomplishes this via tight integration 
with NumPy and transparent use of the GPU. 


While you can build deep learning networks in Theano, | tend to think of Theano as the building 
blocks for neural networks, in the same way that NumPy serves as the building blocks for scien- 
tific computing. In fact, most of the libraries | mention in this blog post wrap around Theano to 
make it more convenient and accessible. 


Don't get me wrong, | love Theano — | just don’t like writing code in Theano. 


While not a perfect comparison, building a Convolutional Neural Network in Theano is like writing 
a custom Support Vector Machine (SVM) in native Python with only a sprinkle of NumPy. 


Can you do it? 

Sure, absolutely. 

Is it worth your time and effort? 

Eh, maybe. It depends on how low-level you want to go/your application requires. 


Personally, I’d rather use a library like Keras that wraps Theano into a more user-friendly API, in 
the same way that scikit-learn makes it easier to work with machine learning algorithms. 


3. TensorFlow 


Similar to Theano, TensorFlow is an open source library for numerical computation using data 
flow graphs (which is all that a Neural Network really is). Originally developed by the researchers 
on the Google Brain Team within Google’s Machine Intelligence research organization, the library 
has since been open sourced and made available to the general public. 


A primary benefit of TensorFlow (as compared to Theano) is distributed computing, particularly 
among multiple-GPUs (although this is something Theano is working on). 
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Other than swapping out the Keras backend to use TensorFlow (rather than Theano), | don’t have 
much experience with the TensorFlow library. Over the next few months, | expect this to change, 
however. 


4. Lasagne 


Lasagne is a lightweight library used to construct and train networks in Theano. The key term 
here is lightweight — it is not meant to be a heavy wrapper around Theano like Keras is. While 
this leads to your code being more verbose, it does free you from any restraints, while still giving 
you modular building blocks based on Theano. 


Simply put: Lasagne functions as a happy medium between the low-level programming of 
Theano and the higher-level abstractions of Keras. 


My Go-To’s: 
5. Keras 


If | had to pick a favorite deep learning Python library, it would be hard for me to pick between 
Keras and mxnet — but in the end, | think Keras might win out. 


Really, | can’t say enough good things about Keras. 


Keras is a minimalist, modular neural network library that can use either Theano or TensorFlow 
as a backend. The primary motivation behind Keras is that you should be able to experiment fast 
and go from idea to result as quickly as possible. 


Architecting networks in Keras feels easy and natural. It includes some of the latest state-of-the- 
art algorithms for optimizers (Adam, RMSProp), normalization (BatchNorm), and activation layers 
(PReLU, ELU, LeakyReLU). 


Keras also places a heavy focus on Convolutional Neural Networks, something very near to my 
heart. Whether this was done intentionally or unintentionally, | think this is extremely valuable 
from a computer vision perspective. 


More to the point, you can easily construct both sequence-based networks (where the inputs flow 
linearly through the network) and graph-based networks (where inputs can “skip” certain layers, 
only to be concatenated later). This makes implementing more complex network architectures 
such as GoogLeNet and SqueezeNet much easier. 


My only problem with Keras is that it does not support multi-GPU environments for training a net- 
work in parallel. This may or may not be a deal breaker for you. 
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If | want to train a network as fast as possible, then I'll likely use mxnet. But if I’m tuning hyper- 
parameters, I’m likely to setup four independent experiments with Keras (running on each of my 
Titan X GPUs) and evaluate the results. 


6. mxnet 


My second favorite deep learning Python library (again, with a focus on training image classifica- 
tion networks), would undoubtedly be mxnet. While it can take a bit more code to standup a net- 
work in mxnet, what it does give you is an incredible number of language bindings (C++, Python, 
R, JavaScript, etc.) 


The mxnet library really shines for distributed computing, allowing you to train your network 
across multiple CPU/GPU machines, and even in AWS, Azure, and YARN clusters. 


Again, it takes a little more code to get an experiment up and running in mxnet (as compared to 
Keras), but if you’re looking to distribute training across multiple GPUs or systems, | would use 
mxnet. 


7. sklearn-theano 


There are times where you don’t need to train a Convolutional Neural Network end-to-end. In- 
stead, you need to treat the CNN as a feature extractor. This is especially useful in situations 
where you don’t have enough data to train a full CNN from scratch. Instead, just pass your input 
images through a popular pre-trained architecture such as OverFeat, AlexNet, VGGNet, or Goo- 
gLeNet, and extract features from the FC layers (or whichever layer you decide to use). 


In short, this is exactly what sklearn-theano allows you to do. You can’t train a model from scratch 
with it — but it’s fantastic for treating networks as feature extractors. | tend to use this library as 
my first stop when evaluating whether a particular problem is suitable for deep learning or not. 


8. nolearn 


I’ve used nolearn a few times already on the PylmageSearch blog, mainly when performing some 
initial GPU experiments on my MacBook Pro and performing deep learning on an Amazon EC2 
GPU instance. 


While Keras wraps Theano and TensorFlow into a more user-friendly API, nolearn does the same 
— only for Lasagne. Furthermore, all code in nolearn is compatible with scikit-learn, a huge bo- 
nus in my book. 


| personally don’t use nolearn for Convolutional Neural Networks (CNNs), although you certainly 
could (I prefer Keras and mxnet for CNNs) — | mainly use nolearn for its implementation of Deep 


Belief Networks (DBNs). 
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9. DIGITS 


Alright, you got me 


DIGITS isn’t a true deep learning library (although it is written in Python). DIGITS (Deep Learning 
GPU Training System) is actually a web application used for training deep learning models in 
Caffe (although | suppose you could hack the source code to work with a backend other than 
Caffe, but that sounds like a nightmare). 


If you've ever worked with Caffe before, then you know it can be quite tedious to define your .pro- 
totxt files, generate your image dataset, run your network, and babysit your network training all 
via your terminal. DIGITS aims to fix this by allowing you to do (most) of these tasks in your 
browser. 


Furthermore, the user interface is excellent, providing you with valuable statistics and graphs as 
your model trains. | also like that you can easily visualize activation layers of the network for vari- 
ous inputs. Finally, if you have a specific image that you would like to test, you can either upload 
the image to your DIGITS server or enter the URL of the image and your Caffe model will auto- 
matically classify the image and display the result in your browser. Pretty neat! 


BONUS: 
10. Blocks 


Ill be honest, I’ve never used Blocks before, although | do want to give it a try (hence why I’m in- 
cluding it in this list). Like many of the other libraries in this list, Blocks builds on top of Theano, 
exposing a much more user friendly API. 


11. deepy 
If you were to guess which library deepy wraps around, what would your guess be? 
That's right, it’s Theano. 


| remember using deepy awhile ago (during one if its first initial commits), but | haven’t touched it 
in a good 6-8 months. | plan on giving it another try in future blog posts. 


12. pylearn2 


| feel compelled to include pylearn2 in this list for historical reasons, even though | don’t actively 
use it anymore. Pylearn2 is more than a general machine learning library (similar to scikit-learn in 
that it respect), but also includes implementations of deep learning algorithms. 
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The biggest concern | have with pylearn2 is that (as of this writing), it does not have an ac- 
tive developer. Because of this, I’m hesitant to recommend pylearn2 over more maintained 
and active libraries such as Keras and mxnet. 


13. Deeplearning4j 


This is supposed to be a Python-based list, but | thought | would include Deeplearning4j in here, 
mainly out of the immense respect | have for what they are doing — building an open source, dis- 
tributed deep learning library for the JVM. 


If you work in enterprise, you likely have a basement full of servers you use for Hadoop and Ma- 
pReduce. Maybe you're still using these machines. Maybe you’re not. 


But what if you could use these same machines to apply deep learning? 


It turns out you can — 
INPUT CONV POOL CONV POOL FC QUTPUT ~ you just need Deeplearn- 
—— f = i ing4j. 


Take a deep dive into 

i r} Deep Learning and 

ae —_—_ ~~ Convolutional Neural 
Networks 
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Figure 1: Learn how to util- 
ize Deep Learning and Con- 
volutional Neural Networks 
to classify the contents of 
images inside the Pylmage- 


Boat: & % Search Gurus course. 
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Curious about deep learning? 


I’m here to help. 


Inside the PylmageSearch Gurus course, I’ve created 21 lessons covering 256 pages of tutorials 
on Neural Networks, Deep Belief networks, and Convolutional Neural Networks, allowing you to 
get up to speed quickly and easily. 


To learn more about the PylmageSearch Gurus course (and grab 10 FREE sample lessons). 
Summary 


In this article, | reviewed some of my favorite libraries for deep learning and Convolutional Neural 
Networks. This list was by no means exhaustive and was certainly biased towards deep learning 
libraries that focus on computer vision and Convolutional Neural Networks. 


All that said, | do think this is a great list to utilize if you’re just getting started in the deep learning 
field and looking for a library to try out. 


In my personal opinion, | find it hard to beat Keras and mxnet. The Keras library sits on top of 
computational powerhouses such as Theano and TensorFlow, allowing you to construct deep 
learning architectures in remarkably few lines of Python code. 


And while it may take a bit more code to construct and train a network with mxnet, you gain the 
ability to distribute training across multiple GPUs easily and efficiently. If you’re in a multi-GPU 
system/environment and want to leverage this environment to its full capacity, then definitely give 
mxnet a try. 
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~ My name is Adrian Rosebrock. I’ve been working in the startup world professionally for the past 
8 years, while at the same time earning my PhD in Computer Science with a focus in Computer 
; Vision and Machine Learning at the University of Maryland, Baltimore County. 


i My Twitter: @PylmageSearch 

My Facebook: PylmageSearch 

| My Google Plus Profile: +Adrian Rosebrock 

| My LinkedIn: Adrian Rosebrock 

You can contact me at adrian (at) pyimagesearch.com. 
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HOW IMPORTANT IS YOUR DATA? 


Years of family photos. Your entire music 
and movie collection. Office documents 
you've put hours of work into. Backups for 
every computer you own. We ask again, how 
important is your data? 


NOW IMAGINE LOSING IT ALL 


Losing one bit - that’s all it takes. One single bit, and 
your file is gone. 





The worst part? You won't know until you 
absolutely need that file again. Example of one-bit corruption 


THE SOLUTION 


The FreeNAS Mini has emerged as the clear choice to The Mini boasts these state-of-the- 
save your digital life. No other NAS in its class offers art features: 
ECC (error correcting code) memory and ZFS bitrot 


protection to ensure data always reaches disk * 8-core 2.4GHZ Intel® Atom™ processor 
: : . + Up to 16TB of storage capacity 
without corruption and never degrades over time. 


+ 16GB of ECC memory (with the option to upgrade 
to 32GB) 


No other NAS combines the inherent data integrity + 2x 1 Gigabit network controllers 
+ Remote management port (IPM) 


- Tool-less design; hot swappable drive trays 
encryption. No other NAS provides comparable power - FreeNAS installed and configured 


and flexibility. The FreeNAS Mini is, hands-down, the 
best home and small office storage appliance you can 
buy on the market. When it comes to saving your 
important data, there simply is no other solution. 


and security of the ZFS filesystem with fast on-disk 
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CERTIFIED 
STORAGE 


With over six million downloads, 
FreeNAS is undisputedly the most 
popular storage operating system 
in the world. 


Sure, you could build your own FreeNAS system: 
research every hardware option, order all the 

parts, wait for everything to ship and arrive, vent at 
customer service because it hasnt, and finally build it 
yourself while hoping everything fits - only to install 
the software and discover that the system you spent 
days agonizing over isn’t even compatible. Or... 


MAKE IT EASY ON YOURSELF 


As the sponsors and lead developers of the FreeNAS 
project, iXsystems has combined over 20 years of 
hardware experience with our FreeNAS expertise to 
bring you FreeNAS Certified Storage. We make it 
easy to enjoy all the benefits of FreeNAS without 
the headache of building, setting up, configuring, 
and supporting it yourself. As one of the leaders in 
the storage industry, you know that you're getting the 
best combination of hardware designed for optimal 
performance with FreeNAS. 


Every FreeNAS server we ship is... 


» Custom built and optimized for your use case 

» Installed, configured, tested, and guaranteed to work out 
of the box 

» Supported by the Silicon Valley team that designed and 
built it 

» Backed by a 3 years parts and labor limited warranty 


http://www.iXsystems.com/storage/freenas-certified-storage/ 





As one of the leaders in the storage industry, you 
know that you're getting the best combination 

of hardware designed for optimal performance 

with FreeNAS. Contact us today for a FREE Risk 
Elimination Consultation with one of our FreeNAS 
experts. Remember, every purchase directly supports 
the FreeNAS project so we can continue adding 
features and improvements to the software for years 
to come. And really - why would you buy a FreeNAS 
server from anyone else? 
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FreeNAS 1U 

+ Intel* Xeon* Processor E3-1200v2 Family 

+ Up to 16TB of storage capacity 

+ 16GB ECC memory (upgradable to 32GB) 

+ 2x 10/100/1000 Gigabit Ethernet controllers 
+ Redundant power supply 


FreeNAS 2U 
+ 2xIntel* Xeon* Processors E5-2600v2 Family 
+ Up to 48TB of storage capacity 
+ 32GB ECC memory (upgradable to 128GB) 
+ 4x 1GbE Network interface (Onboard) - 
(Upgradable to 2 x 10 Gigabit Interface} 
+ Redundant Power Supply 
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FreeNAS Getting Started Guide: 
Part 2, The Initial Configuration Wiz- 
ard 


by Mark VonFange 


This article series is intended to serve as an introductory 
guide to assist FreeNAS users in planning, installation, con- 
figuration and administration for their FreeNAS storage sys- 
tems. This month’s article will cover getting your FreeNAS 
set up using the Initial Configuration Wizard. 


The FreeNAS Initial Configuration Wizard 


For new users, FreeNAS offers a Wizard system to get your basic functions up and running. Once 
you log in for the first time, a configuration wizard will pop up to guide you through the initial con- 
figuration of your FreeNAS server appliance. If the wizard does not automatically pop up, you can 
simply click on the top hat icon in the top menu bar. You may quit the Wizard to perform manual 
configuration any time, but settings are only saved at the end. 


You can find full documentation on the Initial Configuration Wizard at 
https://doc.freenas.org/9.3/freenas_quick.htmi#initial-configuration-wizard and the FreeNAS team 
also has put together a video on the Wizard that can be found on their YouTube Channel or on the 
FreeNAS.org Get Help Video Walkthroughs page at http:/,www.freenas.org/about/videos/. 


First, choose your time zone, whether you want to use a language other than English, and 
whether you will be using a keyboard layout other than the standard US one. Click “Next” to pro- 
ceed to the next step. These steps may take some time to complete, so there may be a wait be- 
fore the next screen in the wizard loads. 


When getting started, the wizard will first ask you to set up your language, keyboard and timezone 
settings (Fig. 1). 
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Wizard 


Language: English 


Console Keyboard Map: 


Timezone: CST6CDT 


cs cs 


Figure 1. Wizard set up. 





¢ Volume Configuration 


Next, the Wizard will present some choices for pre-designed volume configurations (Fig. 2). 
Choose a name for your volume and a standard volume configuration if you wish, then click 
“Next” to continue. If you have a custom configuration you prefer to use, you may quit the Wizard 
and perform Volume configuration manually, then click the “Wizard” button to return to the Wizard 
with the Volume Configuration step left out. You must have a volume configured to continue with 
the Wizard. 
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Pool Name: 


Purpose 


e © Automatic - Pick reasonable defaults for available drives 

“ Virtualization (RAID 10: Good Reliability, Better Performance, Minimum Storage) 
2 Backups (RAID Z2: Good Reliability, Medium Performance, Medium Storage) 

° Media (RAID Z1: Medium Reliability, Good Performance, More Storage) 

° Logs (RAID O: No Reliability, Best Performance, Maximum Storage) 


Estimated Total Size: 18.0 GiB 


Figure 2: Volume configuration menu. 





*Note: If you’ve already set up your volume, this menu will not appear during the wizard session. 
¢ Directory Services 


The next menu will allow you to set up basic directory service settings (Fig. 3). If you know your 
Directory Service account information, select the service you use, enter the applicable informa- 
tion, and then click “Next”. If you do not use a directory service, you can click “Next” and proceed 
to the next step. You can find out more information on this portion in the FreeNAS documentation 
at http://doc.freenas.org/9.3/freenas_quick.htmi#initial-configuration-wizard 
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Directory Service: Active Directory 


Domain Name (DNS/Realm-Name): 
Domain Account Name: 


Domain Account Password: 


Figure 3: Directory Services Menu 





e Shares and Permissions 


Next, you will need to set up at least one share in order to store files on your FreeNAS server 
over the network. On this screen (Fig. 4), choose a name for the share. Windows shares are sup- 
ported by all modern Windows versions, Mac OS X, most popular Linux distributions, and 
FreeBSD, so we recommend Windows shares if you're not sure what type to pick. If you want the 
share to be used by anyone on your network without logging in, click the “Allow Guest” button, 
then click “Add”. You may make as many shares as you want, or skip this step if you don’t want to 
add shares at this time. 
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Wizard 


Share name: 


Purpose 
© Windows (CIFS) Allow Guest 
Mac OS X (AFP) Time Machine 
Generic Unix (NFS) 
Block Storage (iSCSI) Size: 


Delete Update 





[Name 











Figure 4: Shares Menu. 
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If you instead want to assign a particular owner of the share, click the “Ownership” button. On the 
Permissions screen (Fig. 5), enter the name of the user you want to own the share and what 
group you want to share it with. If those aren’t pre-existing system users and groups, click “Cre- 
ate User” and “Create Group” to add them to the system. You will be prompted to create a pass- 
word to create a new user. 


Wizard 


User: _ root Create User (7) 


Group: wheel Create Group (4) 


Owner Group Other 
Read 4 7) 4 
Write 4 
Execute ./ 7) 


Mode: 


| Return | Cancel | 


s é s 





Figure 5: Ownership Menu for User and Group Permissions 


You may also choose what permissions are available to the user, group, and other users. After 
you finish changing the user and group permissions, click “Return” to go back to share configura- 
tion. The permissions you selected will apply to the next share you add. You can find out more 
information on setting up permissions on the FreeNAS YouTube Channel or the FreeNAS.org Get 
Help Video Walkthroughs page. 


“Note: If you’ve already set up a share manually from the User Interface, you will not want to en- 
ter anything on this menu due to the potential of data loss. 


BSD 


MAGAZINE 


62 





¢ Miscellaneous Settings 


Next, the Wizard will offer you a number of configuration options (Fig. 6). “Console messages” 
will enable a live feed of the FreeNAS console log in the footer of the FreeNAS Web Interface. 
The rest of the settings concern sending emails to alert you of issues with your FreeNAS Mini, 
such as failed disks or available updates. 


Wizard 


Console messages: eA wb 
Root E-mail: 
From email: root@freenas.local 


Outgoing mail server: smtp.ixsystems.com 


Port to connect to: 


TLS/SSL: 


Use SMTP Authentication: 
Username: 
Password: 


Password confirmation: 


rons) | | 


Figure 6: Miscellaneous Settings Menu 
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Enter the email address into “Root Email’ where you would like FreeNAS to send alerts. You 
must include the hostname or IP address of the SMTP server required by your email account in 
the “Outgoing Mail Server” field. Port 25 is the default for SMTP, but if you use an alternate port, 
enter it in “Port to connect to”. If the mail server supports encrypted SMTP connections, select the 
encryption type in the “TLS/SSL” drop-down menu. If the mail server supports SMTP authentica- 
tion, check the “Use SMTP Authentication” box and enter the username and password used for 
mail server authentication. 


¢ Finish Configuration 


When get to the final Wizard Menu (Fig. 7), if you are sure about all your settings, click “Confirm” 
to have FreeNAS perform the configuration you’ve selected. If you want to check one more time, 
click “Return to Wizard”, which will take you back through the steps of the Wizard with all your se- 
lected configurations already in place. If you would rather discard your changes and proceed to 
the FreeNAS UI, you may click “Exit without saving” at this time. 


Wizard 


You are about to leave the Initial Wizard and perform all pending (saved) actions. 


Are you sure? 


| Confirm | Return to Wizard | | Exit without saving 





Figure 7. Wizard Menu. 
¢ Connect to your share 


Now, it is time to connect to your share. The Wizard will already have started the share service(s) 
you selected, so it is already working. We will use Windows as an example. Open a File Explorer 
window in a Windows computer on the same network as your FreeNAS server (Fig 8). In the ad- 
dress bar at the top, enter \\ followed by the hostname of your FreeNAS server Appliance (‘“fre- 
enas” by default) or the network IP from your console menu. The current hostname also appears 
in the title bar of the browser window where you configure your FreeNAS server. Your share con- 
tents will appear. You may have to enter the username and password you entered earlier as well. 
If prompted, enter the username and password you set up in step 5. 
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&| ~ | freenas 


Home Share View 








¢ & Cut xX m| t New item ~ e- E¢ Open ~ Select all 
WW) Copy path ¢ Ta Easy access * Edit Select none 
Pin to Quick Copy Paste ; Move Copy Delete Rename New Properties 7 
access [4] Paste shortcut toy tor . folder s @ History eH Invert selection 
Clipboard Organize New Open Select 
€ v 4 | @\\freenas ee < 
~» HD-PCTU2 (D:) a I Zz Z 
General Storage media t 
~» HD-PCTU2 (D;) — < < 
 _Serato_ 5 
) _Serato_Backup Transmission 


backup files 
~ ca787ba98F6229d26c9039 





Figure 8: Accessing your FreeNAS server from Windows File Manager 
Conclusion 


Once you’ve completed the Initial Configuration Wizard, you should have the core functions of 
your FreeNAS server up and running for network access. Next month’s installment will go over 
how to accomplish essential tasks from the User Interface itself, like manually setting up users, 
groups, datasets, shares and services, along with basic administration tasks. Please check out 
the Additional Resources provided for more FreeNAS related guidance in the meantime. 


Additional Resources 
Blogs: 


*FreeNAS Best Practices: Part 1 | Part 2 | Part 3 | Part 4 
(http://www. freenas.org/blog/a-complete-guide-to-freenas-hardware-design-part-i-purpose-and- 
best-practices/) 


« FreeNAS: A Worst Practices Guide 
Forums: httos://forums.freenas.org/index.php 
Videos: 

e Setting up your First NAS with FreeNAS 

e FreeNAS 9.x Video Series 


Documentation: hitps://doc.freenas.org/ 
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In this course, we will learn how to use the current ZFS capa- 
bilities to help us build a home file server using FREEBSD 
10.3. 








Course launching date: 04th of July 2016 
What will you learn? 

¢ ZFS administration 

« ZFS concepts and features 

What skills will you gain? 

¢ ZFS administration basics 

What do you need? 

¢ FREEBSD 10.3 with root privileges 

¢ At least 10 GB free space 

What should you know before they join? 


¢ Basic FREEBSD administration knowledge 


GLAS te 


Module 1: FREEBSD and ZFS 

Introduction to ZFS under FREEBSD 

¢ Why ZFS on FREEBSD? 

¢ ZFS features and concepts 

Module 2 title: ZFS Administration 

Module 2 description: Cover the commands and features to administrate ZFS volumes 
¢ Create, destroy, list pools 

¢ Zpools: single, mirrored, raid 

e Understand ZFS properties 

Module 3 title: Putting it all to work: Hosting our files using ZFS 


Module 3 description: With the previous acquired knowledge, create a plan on how to or- 
ganize our files and pools to host our files. 


¢ Set ZFS properties based on the content of the files to host 
¢ ZFS tuning 


¢ Create a File Server using our pools 


https://bsdmag.org/course/using-freebsd-as-a-file-server-with-zfs-2/ 


marta.ziemianowicz@bsdmag.org 
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Do what you love and you will never 
work a day in your life. 


Interview with Aleksandar S. Sokolovski, Co-Founder of Sci 
Bi 


by Marta Ziemianowicz, Marta Strzelec & Marta Sienicka 


[BSD Magazine]: Hello Aleksandar, thank you for finding some 
time for us. How have you been doing? Can you introduce 
yourself to our readers? 


[Aleksandar S. Sokolovski]: Hi | am Alex, Co-founder of Sci Bi 
(data science solution startup), based in UK (we could expand our 
business with offices in Canada / USA). 


From my alma mater | have a Bachelor of Science in Computer 
Science and Master of Science in Software Engineering and later 
SC Es] on | got my Master’s from the University of Sheffield (MSc in TIE - 
Technology, Innovation and Entrepreneurship). It was a hybrid pro- 
gramme in Computer Science and Business Administration, intended for future manager of tech / 
service / internet based companies. Later on | got a few certificates, including my pending certifi- 
cate in Executive Leadership from the University of Cornell (eCornell since it is officially online cer- 
tification programme). From my work experience, | got my first job one week after my Bachelor’s 
degree in Computer Science as a teaching assistant at a University (1 was the first generation of 
Computer Science students in my country). | worked at the University for about 5 years, then | 
went to work for a telecommunications and internet provider for about 3-4 years. We officially 
started our company Sci Bi 2015. From my personal life | am techie, MMO gamer and Sci Fi 
geek. 





[BSD Mag]: Can you tell us something about Sci Bi? 


[AS]: The story of Sci Bi is, well, | believe, as unique and interesting as any other startup com- 
pany founding story. Sci Bi is officially founded by three founders (Me, Dena and Natalie). To tell 
the story about Sci Bi, | need to start with how the three of us met. | met Dena while | was getting 
my Master’s from the University of Sheffield, we both met with Natalie a few years later. 
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First the story about Dena; as | said, | play a lot of MMO games, most of the games before 2009- 
2010 were browser based MMO games., While playing one of the war games, my group got into a 
virtual “war” with another group. We were at virtual war for months and one of the leaders from 
the group reached out to meet, to discuss peace terms; it was Dena. She messaged everyone 
from our group leaders and | was the only one who responded to her messages; since she said 
she was a girl and was sick and tired of waiting for us boys to do anything, she kinda reminded 
me of my sisters (with her strong female voice not afraid to say and get what she wants, “bossy” 
is the best word to describe her, it is a positive compliment from my side, since | believe and know 
females can be great leaders as well as males). After | got talking with Dena, we reached our 
peace terms for our MMO groups and me and Dena exchanged emails and later on played a few 
other MMO games where we “rocked” at it. We were even then both data geeks or data natives, 
and we did some browser scripts to enhance our groups’ game play, we were both very happy 
when that “primitive” (primitive for now days) script worked. That is when we realized we can do 
this for real world problems not just games. We exchanged “real names” after a year or so and 
since she was in a math master program at the same UK university as me, we met in person and 
worked on a real university project, she has bachelor in mathematics and we did some work to- 
gether on data analytics. Based on my work with Dena, for my second Master's in Software Engi- 
neering, | did my thesis on intrusion detection systems, using data mining and analytics, and after 
it | never stopped working on data science and big data since (but it was not called data science 
or big data back in 2010). 


Second, the part how Dena and | met Natalie. After our university days, Dena and | stayed in 
touch as geek friends and we went together to alumni events including one Sci Fi University festi- 
val, where we met Natalie. There was some Sci Fi movie, in it there was a female character really 
not done right as a strong female, but more like a “damsel in distress” character that needs her 
prince to save her. Natalie started booing at the screen in the middle of the movie screening and | 
supported that and started booing, as well, but her first reaction was that | was booing at her for 
being “too female empowerment”, hence Natalie started arguing with me and then Dena got into it 
as well arguing with her that | was only supporting her argument, not booing against her. Natu- 
rally, we got kicked out from the movie screening and we continued our heated argument outside 
of the cinema, for a few more hours. After it we agreed and disagreed on some points about many 
sci fi topics including time travel. We agreed to meet again and watch the entire movie and dis- 
cuss it. We got together online a few times a week to talk about movies and other stuff. Dena and 
| play Star Wars The Old Republic MMO game, we even got Natalie to play and it she liked it. 


Games, multiplayer, MMO and the “online” world are a great part of our company culture. | met 
one of Sci Bi’s lead software engineer at an E3 game convention; he challenged me to a game of 
Dragon Age Ing. He was victorious, and we are great friends and a good team now. Dena met 
Lyhn (she is in our Sales team) at a game tournament for Heroes of the Storm. 
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In the past month or so at the office when we stay to work over the weekend, our favorite game is 
Star Wars Battlefront and Heroes of the Storm, also a few racing games like Need for Speed, and 
one on one fighting, like Mortal Kombat. 


The last part is the story of how Sci Bi started. When Star Trek Into Darkness got released in 
2013, we got together to watch it and discuss it, and we all agreed that the female science charac- 
ter was classical “damsel in distress” (but JJ did great work on Star Wars VII with Ray). We real- 
ized that for any good “hero” in a movie you need a great “villain”. The same can be said for start- 
ups; for a good startup you need a great challenge, and data science was that challenge for us. 
After one year, we found a VC support and started our company officially in early 2015 (January/ 
February). The three of us are a great team since we complement each other. Natalie is market- 
ing / sales lead, Dena is COO material from start, and | am the tech geek who likes to “play” with 
new tech. There was a TV show “House” where the doctor used brainstorming as the main 
source and method for problem solving, plus he never liked working with someone who thinks 
like him and agrees with him on every topic and point. | am kinda like that, | need someone to 
challenge my ideas not always 100% agree with them. 


The three of us used brainstorming daily even if we are not in the same country / continent. A few 
months back, while | was traveling for business conference to Vienna (btw Vienna in one of my 
favorite cities in Europe, along with Warsaw and Frankfurt), we used slack to brainstorm about 
our new Al service with Dena and Natalie. Dena was, | believe, at the Cape Town University (that 
is in South Africa) for a science conference and Natalie was in New York for a marketing confer- 
ence. Natalie was officially the CEO for the past two years, | was recently appointed CEO a few 
months back, but it was because Natalie could help open and run the office in Canada / USA 
since most of our partners and projects are in USA and Canada. London in Canada will most 
likely be our main North America office, but | find Vancouver and the BC area very startup 
“friendly”. | believe Natalie and Dena are great leaders and | cannot wait for one of them to be the 
next CEO. | have great respect for female leaders, starting from my sister, Dena and Natalie, 
also. In reality, since all of us discuss most decisions, we all have executive partner as job title, 
since the CEO and other titles, at least for now, are only “official” they do not explain what all of 
us do. Dena mostly is doing the internal COO / CFO duties (operation and finance), Natalie deals 
with Partners (CSO / CPO). | work with the technical part of the company (innovation and product 
/ services development). But | do some of Dena’s work in finance related to costs and payments. 
Dena helps me with the Data Science reporting part and she does a great job on the data visuali- 
zation part of the data, Dena can do “real magic” with data her visualization. | work with Natalie 
on the marketing part, | mostly do competitors analysies, what others are doing. Marketing data 
from our site and other sources. Dena and Natalie both work together when we have a Marketing 
Event at a conference, Natalie does the “public” part of it, Dena does the “backend” organiza- 
tional part of the event. 
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Our biggest asset is that the three of us have different opinions about many topics and we 
complement each other and like to discuss / debate our points until, with brainstorming,, 
we can come to a compromise. | am first a software engineer, master in business second, 
Dena is a math geek first, master in business second. Natalie is a communications major 
first, master in marketing second. | believe our “hybrid” education helped us a lot in “see- 
ing” the same challenge from another angle / vantage point, this helps us a lot in our pro- 
ject work. 


If | did not understand business | might have being more concerned with what tech we will use for 
the new app / service, will it be Microsoft, Red Hat, open source, etc. 


As | both a business and engineering major former student, | think about the tech as well, but | 
also know that the average user does not see what tech | used, he can see only the end result, 
plus | would have never found out about MVP (most viable product) if | had not studied business, 
and MVP is better for creating online services, then building a product for months that no one 
might use later on in the future. With MVP, you get user input fast and that helps build products 
and services that someone will / might use. We still don’t have any public service available for 
any user, but it will change very fast. We are working on a great Al solution, that would help with 
everyday problems and management. 


[BSD Mag]: Your web page is very simple! The first interpretation that comes to my mind 
is “sterile”. 


[AS]: Yes, the web pages are bad, | agree. It was done years back while the company existed in 
the business plan. We wanted to give an intro into the data science with a simple and few lines of 
code solution called ARES; it does a fast search of NASA DB to find info on exoplanets (planets 
capable of sustaining life). We are still discussing the new page design, we will launch the new 
stuff when it is ready and right, deadlines are more guidelines for us. We want it perfect. We 
might launch the new site with our new first public service and the new Data Science blog. The 
blog will give new users insight into data science, and the public service will give a view to our po- 
tential customers and partners into what we can do, cause if a picture says 1000 words, an app 
says 10000 words. 


[BSD Mag]: Why data science? And what does it mean that you are a data scientist? 


[AS]: What is data science and a data scientist are very easy questions with complicated answer. 

Data Scientists are defined as people who are skilled in programming, math and statistical analy- 
sis, and data science is the field they work in. | find a simple answer for data science, for me, is 
using science tools to get useful insight / info from data. 
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| would also say that most data scientists | Know are also data natives or have a data native mind 
set towards the expectations from the apps they use and they create. 


For data natives Monica Rogati had a great article on that topic in recode link: 
http://www.recode.net/2014/4/10/11625490/the-rise-of-the-data-natives. 


So in short, the digital natives expect the app / tech to work and they program it, data natives ex- 
pect the app to work as well, but data natives also expect the app to learn on its own so they 
don’t have to program the app. As a data native, | expect my Al app to learn from data it has for 
me and to make the changes on its own, like what is my favorite song to play when | come at my 
desk. | expect the Al to learn my daily routines and do them for me, like checking the weather, 
sending a task summary, checking info on pending projects, check our slack channel chats from 
my team or company wide chats, remind me about events today, etc. Sending my summary of 
that info without the need for me to check all of that info on my own. My personal Al is called LEX, 
LEXA is our Sci Bi Al, used by all. LEX is the bleeding edge version, and LEXA is the stable re- 
lease version. 


[BSD Mag]: What is ARES? 


[AS]: We wanted to give an intro into the data science with a simple and few lines of code solu- 
tion called ARES; it does a fast search of NASA DB to find info on exoplanets (planets capable of 
sustaining life). ARES was hosted on our webpage, and still is. NASA created R packages that 
solved the search problems much, much faster, but for the time and given the terabytes of data in 
the NASA database, ARES was a great solution and great challenge for the time. We might cre- 
ate ARES 2.0, but it will be for some other purpose with NASA's large database, since large data / 
big data sets are our biggest challenge. 


[BSD Mag]: Do you use open source software in your solutions? 
[AS]: We are trying to have 100% open source solutions and all will be on git. 


The application | use most is Slack and our own build task manager SOPHIE, plus Gitlab and its 
wonderful management webadmin page and the gitlab slack integration. We do use MEAN JS 
http://meanjs.org/. This is mostly used for our partners or customer’s frontend apps; our backend 
apps that do the data science Al part use R (The R Project for Statistical Computing) 
https:/www.r-project.org/ with BSD and few times ubuntu servers. | find BSD and its file system is 
one of the best; since ubuntu started using it in the latest versions, | like what | see. We started 
using ubuntu server and CoreOs https://coreos.com/ because of our partners, but Linux com- 
pared to BSD had the advantage of having bigger and more active community, at least in the last 
few years, but BSD, especially PC BSD, will always get my “vote”, even compared to any Linux 
distro. | used GNOME as a main Desktop env, but when | started using Linux Mint, | moved to Cin- 
namon with PC BSD ver. 10, but | respect what BSD is doing with Lumina desktop. 
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| guess people will say it is still buggy, but it can become a great PC BSD desktop environment in 
some later version of 10 or version 11. 


| use Lumina on the PC BSD Virtual Machine for testing. In the past few months, | have been 
kinda looking for a good data visualization tool, but it must be open source, so | can change it 
however | like. For now, good options we use are SVG, d3.js https://d3js.org/ (under BSD Li- 
cence) and Dygraphs http:/dygraphs.com/ charts.JS http:/,www.chartjs.org/ (last two under MIT 
Licence). This web page is a great source for information related to Open Source 
https://opensource.com/ and Open Source Solutions and how to use them. 


[BSD Mag]: Which open source system is your favorite and why? 


[AS]: | can say R on PC-BSD with Firefox Browser since that is what | use daily for 90% of time, 
but my favorite open source system has to be our own slack add-on task manager named SO- 
PHIE. We named it after my sister and Dena’s favorite cousin Sophie. We used slack to do man- 
agers for a few years, but after the new year, we moved to SOPHIE. It is still in beta, but we plan 
to put the entire code or 99% of it on git. It might not be the exact version we are using, but the 
version will work and anyone can download it and use it and change it for them as they want. We 
use slack for everything (for most of our internal and partners communication) and SOPHIE is the 
greatest and most useful tool we had ever build. 


My dev team also uses Gitlab CE and the nice integration with slack. | get progress reports on 
Slack about recent projects without the need to be logged in to gitlab. And SOPHIE puts it all nice 
and tidy together so | can see the progress the way | want, when | want. 


| personally have a MacBookPro 15 2014 model (Apple has superior hardware compared to other 
vendors for laptops; | have an MBP from 2008 that still works). | used OS X, but | have dual boot 
with PC BSD. At work, as everyone else, | use System76 Oryx Pro laptops with Linux Mint 
(https:/www.linuxmint.com/ (but | have PC BSD as well). Linux Mint is very easy to use, even for 
the non tech team members, plus it has great support for games; games are great when you 
have some free time at the office. | personally believe as a team leader that it is not how much 
time you are at work that counts, it is important how much you work and how much you can get 
done. | don’t want people sitting at the office 9 to 5 doing nothing, it’s better to work only 4 hours 
but really work, and play games for the next 4 hours if you have the time, of course. We don’t 
have an official policy for work hours, except when we have meetings, and everyone is responsi- 
ble for his / her time. If you don’t do the work in the morning, do it in the afternoon or late at night. 
| usually come to work around 10-11 AM, stay until 21-22, sometimes 2-3 AM, if we have USA 
CANADA partner meetings, but | will do a nice game break, go to the movies and comeback. 


On Saturday evening, we play MMO team games like Star Wars Battlefront at the office. 
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In the past year, | did a lot of traveling across Europe; | spend more than 50% of the time in a dif- 
ferent city so | feel like all of Europe is my home, not just London and the UK. 


99% of our internal services are browser based, so it really makes no difference what OS we use, 
except maybe for Microsoft operating systems and using Internet Explorer. 


| prefer to use Firefox or Chromium (based on Google Chrome) because of privacy. 


[BSD Mag]: You started your career as a software developer. Have you ever been part of an 
open source community? Why have you decided to go into business direction? 


[AS]: In an official capacity or in open source community panels sadly no, for now at least. In high 
school, | was a math major and did some programming in Pascal and C in my free time. In 2003, | 
went to the University, and after | got my Bachelor’s degree, | enrolled for my first master within 
the same year, after that within the same year | enrolled into my second master, so from 2003 un- 
til 2013, | was working and being a student and being a gamer. From around 2013, we started 
working on our startup, so full-time schedule from 2003 until now. | helped out within the open 
source community as user, reporting bugs, helping out with bug reports and answering questions, 
but | was not part of an open source project team. It might be | still have not found the right pro- 
ject yet, but if | find a good open source project, | will spend time on it maybe within the BSD, 
CoreOS or Linux Mint community. 


If | don’t find an interesting open source project to join at the moment, we might start our own 
open source project since we have a lot of active projects we can put on git, like SOPHIE. 


While | was a teaching assistant in Computer Science, | saw the value of a cooperative open 
source community, but | realized that | wanted more proactive practical projects, not just theoreti- 
cal. Then | moved into the business world at the telecommunications provider Neotel in the data 
center, but sadly big companies cannot always use the latest research innovations in the work, 
even if they use high tech solutions. Neotel started using BSD for the first time for real chal- 
lenges. | had very wonderful years as a teaching assistant and lecturer at the University and an 
engineer in Neotel, | learned the value of teamwork in both and both places were great teaching 
places for me. 


Sci Bi is a good combination of both, like the best of both worlds; we use the high best tech solu- 
tions and use the newest that science can offer. | “love” my job and enjoy working with Dena, Na- 
talie and the entire Sci Bi team and because of it, | have no problem working 50-70+ hours, since 
it does not feel like work (at least most of the time). 


[BSD Mag]: You have been involved in many projects, you have many certificates, awards 
and publications. Also, you have graduated from many universities. Why? Do you find it 
important to gain more diplomas or is it about self-development? Do you think nowadays 
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Are they interested in gaining knowledge in a different fields and should they be or should 
their knowledge and experience be focused on one narrow field? 


[AS]: Yes, | have worked on many science and research projects (I still do). Most of the certifi- 
cates are because | found something interesting. | go to the course, and | find that when you 
know you are going for the certificate, you study more. Most of the certificates were for me not be- 
cause | need it for some project or work application. The latest Coursera course in 2015 for R pro- 
gramming and data science toolkit, | only took it to see how much that course will help my team to 
better understand data science, and Coursera and other educational platforms are great, | be- 
lieve. | personally believe in Knowledge sharing and my business degree thesis back in 2009 was 
investigating the value and potential of web tools in education. It was my belief at the time that on- 
line educational platforms that offer blogs, forums and access for everyone will become major 
educational contributors in the next 10 years. | was wrong, it become popular within in 5 years 
(Coursera one of many). 


[BSD Mag]: You like MMO games! Which one have you been playing recently? How do you 
find time for that? 


[AS]: Yes, | believe | mentioned few times during this interview that games are “part” of our Sci Bi 
company, since we met with Dena during a game, and we found games to help out with stress at 
the office, and “killing” the down time at the office. We are a small team of about 10 people (most 
of us work remotely, me being one of them), | don’t know if with 100+ team we can do it the same 
way but | believe we will always have a game room / break room, maybe in few years VR head- 
sets for everyone, when they become part of every new and modern game. | am a Sci Fi geek 
and gamer for life and yes | am proud of it. 


[BSD Mag]: Are there any challenges your company is facing at the moment? Any plans 
for the future? 


[AS]: Our biggest challenge is to create the perfect publicly available service app web service 
backed by data science and Al (MVP was a great help here) before September, maybe sooner, 
and launch the data science blog. It will help us to better and more clearly present our solutions 
to our partners and the more users we have, the more input we will get from them. SOPHIE’s pub- 
lic version as a Slack add on might do just that. 


Yes, our company's biggest and ultimate argument between Dena, Natalie and me: what is better 
Star Wars (Dena), Star Trek (Natalie), or Stargate (me)? It’s our favorite topic for the pub / bar. | 
can say in this interview that they agreed with me about Stargate being best of Star Wars and 
Star Trek, but | won’t because | will never hear the end of it (plus George being Firefly fan and 
Lyhn being Battlestar Galactica fan, does not help me a lot, | would say). 
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My biggest challenge with my Al engine (LEX and LEXA) is, | am still waiting for an answer on the 
questions from the hitchhiker's guide to the galaxy question about life and all, and | don’t accept 
33 as the answer. 


[BSD Mag]: Do you have any piece of advice for our readers? 


[AS]: My advice is be open minded and try to see the other point of view even if you don’t fully 
agree with it, because finding great solutions requires a team, you cannot do it alone. Any future 
student that wants to work on great projects, you must learn to be a team player. If you want to be 
a good leader, you must learn to leave your ego at the door and compromise. | personally believe 
in leading by example, not by orders or authority. If | want something done, | start to do it first and 
the others join without the need to order them or boss them around. People can often find me 
bringing in coffee for everyone at the office like an intern. 


My motto in life is: Do what you love and you will never work a day in your life. 





Aleksandar S. Sokolovski is a data native | 
with years of experience in Software Engi- } 
neering, Teaching, Lecturing and Data Sci- 

ence. He is a Co-Founder of Sci Bi | 
(startup in offering data science solutions). 4 
His biggest asset is being a great mentor | 
and the ability and natural way of knowl- 
edge sharing, open source projects are | 
great interpretation of his knowledge shar- | 
ing philosophy in life. He finds his work in 

data science and machine learning to be 

| very similar to teaching, since most of it re- | 
|} quires training and teaching Al’s to use data sets and big data. : 





He has a great passion for data science and believe that trained Al can help us in our | 
| everyday tasks at the office and at home and will give us more quality time with our fami- | 
| lies. | 


| New web page: http:/Avwww.sci4bi.com 
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With the recent British referendum decision to exit from the 
Europe Union, one of the few opinion polls to correctly pre- 
dict the outcome was online. What impact is the Internet 
having on the established political order and is this a force 


for good? 


by Rob Somerville 


The UK is currently still in a state of demo- 
cratic shock after the surprise result to commit 
to Brexit. Certainly, the general consensus be- 
fore the result was announced was that it 
would be a close call, but the majority of peo- 
ple seemed to think that the result would be to 
remain. What is more surprising, though, is 
that the traditional polls and surveys generally 
all agreed with this. While it could be argued 
that there was no precedent to base a poll on 
and, therefore, this would undermine sound 
statistical analysis, there is a much stranger 
dichotomy at work here. The Internet is gener- 
ally considered the domain of the younger 
generation, who pretty much staunchly re- 
sided in the remain camp. The older genera- 
tion (particularly the over 50's), were very 
much in the Brexit camp. The demographic 
split was extremely polarised — rich versus 
poor, North versus South, Globalist versus Iso- 
lationist. Where the split was less clear was 
between the politically left and right; there 
were advocates for both positions across the 
political spectrum. So one could logically ex- 
pect the Internet community to be pro remain, 
and the poll to reflect this. But the exact oppo- 
site was the case. 
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While the UK tabloids took the position of 
leave, the broadsheets seemed to take the op- 
posite view. Even the campaign itself — a vi- 
cious demonstration of mendacity, dirty tricks, 
mud slinging and confusion — only addressed 
the superficial issues. Little focus was truly 
given in having an intelligent debate about the 
matter, as both sides rigidly stuck to their re- 
spective positions, repeating their superficial 
mantras until the voting public were _ thor- 
oughly sick with the whole PR circus. It was 
only online that the vista became more colour- 
ful, from the obvious tin foil hat brigade 
through to the dogmatists who held their posi- 
tion with an almost religious fervour. The 
usual online rules applied, of course, from troll- 
ing through to ad hominem attacks. While the 
mood was frequently ugly, and often the 
boundaries of self control, political correct- 
ness and good manners were trampled under- 
foot in the passionate race to win the argu- 
ment, at least people were able to share their 
true observations, opinions and experiences, 
rather than the sanitised and anodyne discus- 
sions that were managed by the traditional PR 
and communications sectors. 
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That is why | believe that the new battlefield 
for political credibility and votes is online. The 
politicians have dipped their toes in the water, 
and while various initiatives have attempted to 
engage with the Internet audience, as usual, it 
is too little too late. There are hundreds of 
thousands of long established political web- 
sites, blogs and online videos with millions of 
visitors and these sites have a unique culture 
and vision that is rooted in their own particular 
values, whatever they may be. The audience 
is committed, loyal and unequivocal in their 
trust, and in some cases this extends as far 
as financial support. These voices are trusted 
as they closely reflect the views and opinions 
of the target “consumer” far better than the dis- 
connected and out of touch politician. In short, 
the new kids on the block have not only estab- 
lished a genuine rapport, but have credibility 
and trust that other forms of media and com- 
munications are rapidly losing. Newspaper 
proprietors and television moguls bemoan the 
loss of their audience to new media, and like 
the music industry, refuse to accept the fact 
that the writing is on the wall. People are fed 
up with being fed a diet of sterile opinion that 
refuses to engage with the real issues as the 
message is passed through the filter of medi- 
ocrity. They would rather engage with like 
minded folk or even tackle the opposite camp 
head on and have a damn good argument 
about it, and maybe clarify a few ideas in the 
process. There is much good that can come 
from such communities, provided they are not 
just echo chambers and places to stroke each 
other’s ego. The biggest danger is that we 
end up with an army of keyboard warriors, 
that don't get off their bottoms and actually do 
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something in the real world to change things 
for the better. 


The real battleground however, is trust and 
credibility. Astroturfing is widespread online, 
where PR 


companies, organisations and individuals set 
up multiple sites to anonymously promote 
their point of view, pretending to be grass root 
activists. And this is the biggest problem on- 
line — while the erosion of trust has already un- 
dermined traditional politics and the media, 
the same disease threatens to destroy these 
groups. Acceptable use policies, moderation 
of commercial sites and to a small degree, 
censorship, all have their place, but how can 
you trust a site or community? Once you 
emerge from the box of middle ground consen- 
sus, how do you deal with divisive and conten- 
tious issues such as the EU referendum that 
have split families and will no doubt end up 
with a few casualties in the divorce courts? 


The biggest problem with new media is that it 
is far too easy to change your tune at the click 
of a button. This makes for a very transient 
and fluid set of ethics in the wrong hands, and 
while we have tools such as the Wayback Ma- 
chine, nothing compares to saving pages 
offline or via a screenshot. A major online UK 
broadsheet has been caught in the act spiking 
a negative story on one of the Conservative 
leadership candidates, and even the Wayback 
machine suffered from a bad dose of amne- 
sia. If it were not for a few netizens who took 
the time to document this, no doubt this bra- 
zen volte-face would have gone unnoticed. 
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The solution to this problem, as ever, is trans- 
parency. Especially financial transparency. It 
is very easy to distil the motives of the opinion 
maker once you follow the money. Where it 
becomes more difficult is in the arena of politi- 
cal and social opinion, and those that have 
great influence as opinion formers. A well pre- 
sented and engineered video, blog or website 
can be extremely deceptive. Taken in by qual- 
ity presentation, powerful argument and rheto- 
ric, it is no wonder that many are taken in by 
propaganda, and worse, still fall prey to radi- 
calisation. And sadly, while many are abandon- 
ing the traditional media hoping to find a more 
idealistic platform, the risks and potential for 
the ultimate breakdown of trust are so much 
greater. 
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